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The ground state energy of the Bose particle system at low densities is investigated by using 
the method which is based on summing up the terms of the conventional perturbation series givirg 
rise to the lowest order with respect to the density. The summation is shown to be greatly simplifi- 
ed if the scattering matrix is introduced, and a simple example is demonstrated for the repulsive 
square well potential by calculating a few terms of the series first directly and secondly with the aid 
of scattering matrix. The final result is shown to be valid both for weak and strong interactions and 
the calculation is carried out analytically for some types of potential function. The connection of the 
present method with the pseudopotential method and with the Brueckner theory is investigated and 
some problems associated with the attractive interactions are discussed. 


§1. Introduction 


In some problems of quantum mechanics or statistical mechanics a method is employed 
which is based on the summation of the important diagrams in the perturbation series’. 
In this paper we wish to investigate the ground state energy & per particle of the Bose 
system along this line. 

The quantity €, naturally depends on the density y” of the system, on parameters 
characteristic of interatomic potential, etc. Now, let us imagine to plot & versus / curve 
and seek the behaviours of the curve near at )—0. At p=0 the system is essentially 
assembly of free particles and it follows &(”=0)=0. Therefore it is sufficient to calcu- 
late (0€,/9/),-0 for our purpose. In other words, our problem is the idealized one to 
calculate the term linear in when &, is expanded in terms of /’. 

In § 2 the formal perturbation series in powers of the interatomic potential is discus- 
sed and the diagrams giving rise to the lowest order term of the ground state energy are 
identified to be those corresponding to the virtual processes in which the pair creations 
and annihilations take place successively. In § 3 the summation of the diagrams is carried 
out with the aid of the free-particle scattering matrix. A simple example of this procedure 
is demonstrated for the repulsive square well potential ; first, a few terms of the series 
are calculated by direct integrations in § 2 and secondly by the use of scattering matrix 
in § 3. Both methods yield the same result for the terms of the series up to the order 
investigated in this paper. The lowest order term of the energy is calculated analytical- 
ly for some types of the interatomic potential, and the problem associated with the pre- 


sence of the bound states is discussed. In § 4 the perturbation series in terms of the 
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interatomic potential is transformed into the one in terms of the scattering matrix and it 
is shown that the diagrams summed up in § 2 do not appear in the new series and that 
an application to the hard-sphere potential leads to an expression obtained by pseudo- 
potential method. The alteration of the energy denominator proposed by Brueckner and 
Sawada” is interpreted in an elementary way from the view-point of the present paper. 
It is suggested that an appropriate variational procedure is necessary to deal with the 


Bose system with attractive but saturating interactions. 


§2. Perturbation series for the Bose system 


Let us discuss in this section the behaviour of the formal perturbation series in 
powers of the interatomic potential for the Bose system. We assume that the unperturb- 


ed Hamiltonian is 


I= /2m-Dik,*a,* a, (1) 
« 


and the perturbing Hamiltonian is 


Ib! =P? /2m-1/2V - Sv (af, 47) 4,*a,*a,4, . (2) 


“pay 


The operators a* and a are the usual creation and annihilation operators for the Bose 


particles. The matrix elements v(a/3, 47) are taken in the momentum representation, 1.€., 
. 1 al : . k : 
v(a@f, 47) pay exp (—ik,:X,—ik,-xX,)v(1,2) exp (ik, -x,+ ik, -x,) dx,dx, 


=0(ke+ k,—k,—ky) v (Kas, ky;) (3) 


where V is the normalization volume, b*/2m-«(1, 2) the two-body potential, ky;= (ka— ks) / 2, 
ky,= (ky, —k,) /2, and v(q, q’) is defined by 


v(q, q’) =|exp (—iq:-x) v(x) exp (iq’-x) dx. (4) 


In the following we shall consider the non-singular two-body potentials so that the 
perturbation method can be applied. To deal with the case of a singular potential such 
as a hard-sphere one, it is still convenient to work directly with the matrix elements v(a/, 77’) 
by adopting some appropriate cutoff procedure. We shall prove later on that the final 
result is independent of the choice of such procedures. 

Now, each term of the perturbation series can be written as an integral over various 
vectors q;, which are virtual momentum transfers. The successive terms of the series 
are : 


E,=(0|26'|0) =62/2m- N(N—1) /2V-v(00, 00) 
=h/2m. N(N—1) /2V-1(0, 0) (5) 


where N is the total number of particles. The second-order energy is 


Ee Tere as / _ ip 
(o|I6 mere #6'|0) 
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INN e= 1) ol v(00,— qq) +v(00,q— q) 
2m (2)? (227°) 


-v(q—q,00) 


2 \ 
EE NNN SLE C0.) G0) (6) 
2m (2V)?> am (—q?) 
where use is made of eq. (3). In order to ‘proceed to the higher order terms, it is con- 
ve Hi venient to introduce a graphical 
¥ AA / representation of the virtual pro- 
we 
Ni 7 cesses as was used by Brueckner 
\ Ye and Sawada’. In Fig. 1, we 
Fi : ; ; 
Ne / give the diagram which repre- 
2 i be sents a process giving rise to 
\ By 
Then, the diagrams corres- 
Ws \ : 
y \ ponding to the processes that lead 
if 2 i 
\ POS to the term linear in ¢ are easily 
Fig. 1 Diagram contributing to E. oa ~o identified. Consider, for ex- 
ate exited —- eee Fig. 2 ample, the diagram shown in Fig. 
dicated by solid lines and ‘ 
Fie mien red parties toy 2. The process corresponding to 
dashed lines. this is characterized by the annihi- 
lation or creation of the unexcit- 
ed particles (particles with 0-momentum) in the intermediate states and 
i. i : *a *a *aa¥* 4 
N A, therefore the energy term contains a factor (0|a,*a,*a,*aya,*a,a,a)|0) ~N 
\ in this case. Now the energy should be proportional to N and this suggests 
that the term under consideration has the form Np* and is of the higher 
orders with respect to (the factor V~* comes from J¢’ and the 
‘ sum over intermediate momentum states). We see from this argument 
| that the lowest order term in ¢ is such that the annihilation or cre- 
ation of the unexcited particles must not occur except for the initial or 
final state. Then the diagrams having this character are easily written 
down. Since the annihilation of the unexcited particles does not 
KE A occur in the intermediate states, in each stage of the virtual processes 
the pair with momenta q and —q should be destroyed and excited 
N —h to the states (q’,—q’), etc., and finally returns to the ground state. 
ae Thus the diagrams interesting to us will be of the type shown in Fig. 
Le pees 3. Then the energy term in the n-th order corresponding to this pro- 
7 : 
Fig. 3 cess is easily shown to be 
BN(N=1)_ sv {v(00,4n-1— Gr-1) +. 0(00, = Gn Int) fo" 10 (= Io = WH) + 
2m (2V) i ment fo 2g.) fe 2Ghes:) pegs (2050) 


U(Go— qo, — qi) }v(Gi— 91,90) Gy 
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i be th 
where SY means that q,40, q2¥%0, °°, Qu-1~O- Therefore, if we define E, to e 


lowest order term of the energy, we have 


ee ee, 0) =- 1 SY y(0, q)¥ (Ip 0) 
4mV (2V) I qi 


+ al al y(0, qo) ¥ (os, qv) v(q;,0) 
(2V)* %14. q2 91° 

fads 3A ants y(0, Gn—1) ¥(Gn—1» qn—2) e wits fa (q:, 0) 4+ ae 2 ' : 
(2V) "9 tot, (=a) (Ha) (—9) 


(8) 


Here we have carried out the change of variables q;>—q,, q2—->— 2, etc., in eq. (7) 
and used eq. (3). 

Before discussing the summation of (8), we wish to calculate by direct integrations 
a few terms of the series for the repulsive square well potential as an example. 

For the potential 


v(x) =% (u>O0), r<a, (r=|x\), 
=0, r>a, (9) 
v(q, q’) is given by 
v(q, q’) =47xu,47)j, (ka) /k (10) 


where j,;(x) is the spherical Bessel function of order 1 and k=|q—q’|. Then we have 
v(0, 0) =4zv,a*/3 , 


1 Sv v(0, 1) ¥ (In 0) =4y,2a" (" ir 4) = . 4zvea" 


- (27). t q:° Jog 15 


and 
—s LS! v (0, GW)» (Fx 1) ¥ (4s, 0) 
(2V)? 17, qq 
— 2'a" \.- hi (14) a, nee (get) sin 4(9;— qo) dq, 
eae) SOs pe: qs (qi— 4s) 
= 47y,°a"- 17/1260. 
Here we have used the Weber-Schafheitlin integral for the Bessel function’. Using above 


equations, we have from eq. (8) 


Eye i Ut aya 417 via’ +} (11) 


N m 6 30 2520 


up to the order of 7°. The further calculation is cumbersome and it is difficult to find 


the general term of the series. A convenient method to calculate the series will be given 
in the next section. 
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§ 3. Summation of the series 


In order to sum up the series (8), it is convenient to replace the problem of sum- 
mation by an equivalent and a simpler one. For this purpose we wish to proceed to 
the more general case where the unperturbed state is represented by the occupation numbers 
n,’s, not necessarily corresponding to the ground state. Then it is easily verified that the 


energy term arising from the diagrams discussed in § 2 is given by 
Ey(m) =—— 131 u(aB, Hy) (mglag*ay*aza,|m) (12) 
Dai OV ae ? tied baa hk \ 


where 


ee alae) ee ma 1) 
i Te SO 


nile Laiy = v(ap, A5[3y) V (Aa/Fo, @,8,) (ah, /y) 


pak Sos 5 STS a aaa hee 13 
V* wats (ky! hy? — hag! haa!) (bx? ty? — hay? — hy) a 


In eq. (13) S$)’ means that the values of k,,, ks, etc., which make the energy deno- 
minator vanish, are excluded from the summation. One may verify that eq. (13) reduces 
to eq. (8) in the case of ground state, i.e. n,=N and n,=O0 (k 0). 

Now, we note that the matrix elements u(a@/, 47) satisfy the equation 


u(or,ay) 


fi Pp 2. k oy =i 2 =U ag, 4y), (14) 
a Th So. oes 


v(af, Ar) += SY ola, oT) 


which can be proved from the defining equation for u(a/, 47), eq. (13). This equation 
implies that u(@/, /7) is just the matrix element of the free-particle scattering operator’. 
In order to transform this equation in the coordinate representation, we consider the wave 
function /,, (1, 2) defined by 


: } [Ps ik: X%,+ik,-x, 
Py, (1, 2) =exp (ih, -%,+ik, + x2) + ia = ar a sa : 
WY la] Tv 


ri 


x exp (—ik,-x,!—ik,-x,/)u(1'; 2’) By, (1, 2’) dx,'dx,' . (15) 


If we multiply this equation by exp(—ik,-x,—ik,-x,)v(1, 2) and integrate by x, and 


x, in the volume V, we see that the resulting equation is just identical with eq. (14) 


provided that u(a@j, 47) is given by 
u(ap, Ar) =- 7 exp (—ik,:*x,—ik,-x,)v(1, 2) Pi,(1, 2) dx,dx, . (16) 
Or, if we put 


P,..(1,2) = exp [i(ka +k,) -X]o,, (x) 


where X is the center-of-mass coordinate |= (x,+.4,)/2| and x the relative coordinate 
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(=x,—x,), then we have 
u(a/s, 77) =0 (Ieg+ ky — Ik, — ky) exp (—iltas“*) u(x) @,, (x) dx (17) 


and 


Py, (*) =exp (tka: x) 


i Boag Bed AU i [exp (— ies ®')0(®) Ga 2) A (18) 
ZV" hye ae he 


If we operate p® on eq. (18), we have 


ea +22 |o(x) ee Tae \ expliq- (x—x’) u(x’) ¢(x’) dx’. 


(19) 


This is the Schrédinger equation for the scattering problem except for the second term 
on the right-hand side. The appearance of this term is due to the finite volume of the 
system and consistent with the periodic boundary condition imposed on g(x). This will 
be seen in the following way; if we multiply eq. (19) by exp(—igqy-x) with qy satisfy- 
ing the periodic condition, we have 


(qo? — Fay”) exp (—igu-«) ¢(x)dx= 1/2-[O (Kay, Go) — 1) | exp (—iq,:x) v(x) ¢(x)dx. 


From this equation it follows that the appearance of the second term is consistent with 
the boundary condition, since, if this term is missing in eq. (19), then it follows 
jexp(—iqy-x) v(x) g(x) dx=0, for go=hars which is not always valid for an arbitrary value 
of k,, satisfying the periodic condition. In the limit V—»co, however, this term may be 
neglected and eq. (19) reduces to the usual scattering equation. 


Returning now to the summation of (8), we see that E, is given by 
E,/N='0/4m-u (00,00). (20) 


In calculating u(00,00) it is sufficient to consider only the spherically symmetric 
solution of eq. (19). Also the solution should approach 1 for r==|x|—>0o, for 
vy (x)—exp(ik,,-x) at large r and k,,=0 in this case. Furthermore, the second term 
on the right-hand side of eq. (19) may be neglected as we have mentioned above. 

From these considerations it follows that 


uv (00,00) =42("o(0) R(r) Pdr (21) 
where R(r) satisfies the equations 


[1/r?-d/dr (r°d/dr) —v(r) /2|R(r) =0 (22a) 


and 


R(r)>1, (r>00). (22b) 
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Furthermore, R(r) should be regular at r=0. 
From eqs. (20), (21) and (22a) we have 


E, ‘N= 27h? /m ¢ (r°dR/dr) n=O * 22) 


We see that eqs. (22a), (22b) and (23) are applicable to the strong interactions 
and identical with those obtained in the previous paper”. For the repulsive square well 
potential considered in § 2, the solution of eq. (22a) is 


R=1+A/7, (r>a) and R=Bshj/v,/2-r/1,, (r<) . 
From the condition that R(r) and R’(r) should be continuous at r=a, we have 
Be ore chi af ed 
Therefore from eq. (23), we have 
E,/N=2z7'p/m- {a—)/2/% th a/v,/2}. (24) 
If we expand thx as 
th x=x— 1/3-x°42/15-x°— 17/315-x’ +--+ 


the result is easily shown to be identical with eq. (11) obtained by direct integrations. 

It should be noted here that eq. (24) remains finite as y,—0o. On the other hand, 
if thx is expanded in powers of x, each term of the series is divergent in this limit. 
This feature of the case of strong interaction implies that an appropriate procedure of 
summing up all the divergent terms of the conventional perturbation series is necessary 
to obtain a finite result. For the potential under consideration, the limiting case is just 
the hard-sphere one, and eq. (24) is naturally reduced to the formula obtained by Lenz”. 
We wish further to remark that the discussions developed in this paper are also appli- 
cable to the weak interactions for which v(q, q’) is finite. In this case, each term of 
the perturbation series may be finite. However, the result is meaningless if the series is 
terminated in the finite order terms as far as the density dependence of the energy is 
concerned. Therefore it follows that the summation of diagrams is necessary even for 
weak interactions if one wishes to investigate a correct functional relation of the energy 
with the density. 

In conclusion we wish to calculate E,/N for some types of potential function. 


(a) Attractive square well potential with the hard-core repulsion 


In this case v(x) is given by 
Uae, rad, 
ok) =—%Uy, (>), a<r<atd, 
u(x%)=0, atd<r. 
It is easy to see that 


Fo = PEE fat d—/ 2 tan ,/v,/2 -d]. (25) 
m 
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(b) Sutherland potential 
Two-body potential V(r) is defined by 
V(n)=m, <a, 
Vir) ==—V, (a/)*, Ve>0), 1 > 4. 
For this potential the solution of (22a) is given by the Bessel function for 1 >a, and 


the boundary conditions at r= co, eq. (22b), and at r=a, R(a)=0, uniquely determine 


the solution. Then we have 
Ep ait OS ae 


N m. I'{(n—1)/(n—2)} \ n—2 / Jinn» (2ka®-® / (n—2)} 


(26) 


with =m ,a"/b° . 
In particular, for the van der Waals attraction, n=6, eq. (26) reduces to 


(26a) 


Ey 2 G74). ef mV , ct J-sja{ (nV)? a /26} 
N m PGyay 82. Jia) (nV)? / 26} 


In the limit V,—0, eq. (26a) becomes Lenz’s formula” for the hard-sphere potential. 
(c) Lennard-Jones (6-10) potential 
This potential was discussed in the previous paper’, so that we only give the result : 
3/4 / a ran 
lo my 2 2 PG, HG, 8 7) m-7!8 718 GSI BM : (27) 
N (1/4) (3/8—7) 


It should be noted here that the E,/N becomes —co if the bound state with zero 
ehergy appears in the two-body system. For example, if 1 (%/2)d is equal to 7/2 in 
eq. (25), E,/N=— oo. The above condition for v and d just corresponds to the ap- 
pearance of the bound state with zero energy. In this case E, N versus ~ curve coin- 
cides with the negative E,/N axis near ¢=0. The similar situation exists for the poten- 
tial functions (b) and (c). In the case of (b), (we take n=6 for simplicity), as V, 
increases from 0, E,/N decreases from the positive value until it becomes — oo when 
(mV,)'? a/2b=<¢, where ¢ is the least positive root of J,,4(¢)=0. Similarly, for the 
potential (c), E,/N becomes —co at 7=5/8. It seems, therefore, that the behaviour of 


the energy versus density curve is markedly dependent on the presence of the bound state 
in the two-body problem. 


S$ 4. Perturbation method in terms of scattering matrix 


In the previous sections we have discussed the summation of the diagrams giving 
rise to the lowest order term of the energy. If one compares eq. (5) with eq. (20), 
one may easily see that E, is just the first order energy with v(a9, 47) replaced by 
u(aj, Ay). This suggests that the potential function is effectively replaced by the scat- 
tering matrix. In this section we wish to discuss the perturbational treatment of the 
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scattering matrix. 

Let us first consider the meaning of eq. (14). Originally this equation was introduc- 
ed for the purpose of summing up the series (8) and u-matrix is explicitly expanded in 
terms of »(q’, q) as in eq. (13). However, we may regard eq. (14) conversely as the 
defining equation for »(q’, q) in terms of u-matrix. Then we have, writing u(af,/y) = 


u(q’, q) with k,g=q’ and k,,=q [see eq. (17) ], 
sya, Tat (q", q) 


v(q’,q) =u(q’,q) +— 


- ra q/?@—¢ 
| Sy u(q’, q)u(q”, q”’)u(q’,q) BS siete : (28) 
(2V)? qigit! (q/"—q’) q’’?—aq!””) 


If one substitutes this equation in the perturbation series in powers of y-matrix, the series 
will be transformed in the one in powers of scattering matrix. For example, if one re- 
tains only the first term in eq. (28), the first order energy will be given by eq. (20). 
As is well known, the scattering matrix does exist even for the strong interaction and 
therefore it may be expected that the series in terms of u-matrix is valid for such a case. 
Furthermore, the following feature of the series may be expected: since the diagrams, in 
which the creation or annihilation of the unexcited particles does not occur except for the 
initial or final state, are included already in the u-matrix, such diagrams will not appear 
in the u-matrix series. In other words, only the diagrams corresponding to the process 
of the creation or annihilation of the unexcited particles in the intermediate states will 
appear in the new series. That this is the case will be verified up to the third order 
in the following. 

The first and the second order energy were given by eqs. (5) and (6) and the 


third order one is easily shown to be 


pam Peay WN 1) SH v(0, q’)» v(q', we OE SSO Avi 0) 
2m (Ae CU qq” q q 
x {2(N—2)[»(q/2, 4/2) +»(—4/2, 4/2) |— (2N—3)¥(0, 0)} | (29) 


If we substitute eq. (28) in this expression and in eqs. (5) and (6) and express 


the perturbation series in terms u-matrix, we have 


je o IN UN =l) | (Os0) ape ea u(0, Du(4, 0) 
2m 2V (2 a q 
x {2(N—2)[u(q/2, q/2) +u(—q/2, 4/2) |— (2N—3) u(0, 0)} | (30) 


up to the third order. It should be noted here that the diagrams corresponding to Fig. 
3 no longer appear in eq. (30), as we have expected. The similar feature of the new 


series may be true for any higher order terms. 
If we consider the hard-sphere system and set u(0,0) to be 87a, neglecting the 


dependence of u-matrix on q, we have from eq. (30) 
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E _ ,2zabio E 


2 1 
eG} Fob | 
\ N m x7? 


(2+ m?+n®)? 


a 


(V=L*), which is consistent with the result obtained by Huang and Yang” with the 
aid of pseudopotential method. Therefore we are able to conclude that the pseudopoten- 
tial method is equivalent to the perturbation theoretical treatment of the free-particle 
scattering matrix. However, as was pointed out by Brueckner and Sawada", the pertur- 
bation method is not appropriate in this case, for the energy term corresponding to the 
diagram shown in Fig. 4 is divergent. To avoid this difficulty they have summed up the 
diagrams of this type up to infinite order. This process seems to be quite reasonable if 
the v-matrix formalism is employed, since the diagrams appearing in 
See the new series must at least once create or annihilate the unexcited 
particles in the intermediate states. Indeed, as we have shown pre- 
y viously", this formalism leads to the exact formula obtained by Lee, 
Huang and Yang'” which is appropriate for the low density limits. 
<. Therefore one may conclude that the free-particle scattering matrix 
formalism is exact as long as the density of the system is sufficiently low. 
HES At higher densities, however, the free-particle scattering theory 
/ s will become poor, since, in this case, the many-body effects of the un- 
i excited particles on the virtual pair will not be neglected. In order 
Fig. 4 Divergent ; 
diagram. to deal with these effects, Brueckner and Sawada* have altered the 
energy denominator so that the Green function describes the propagation in the presence 
of the unexcited particles. This propagator alteration may be interpreted in the follow- 
ing way. Also this procedure will provide a clue to solving the difficulties associated 
with the attractive interactions. 
As we have done in § 2, we start from the weak interaction, i.e., we consider the 
Hamiltonian 


Bones Baad : 
= 2a,*a, + -S) v(af, 47) a,*a,*a, 
2 ae Ca Ce am 2 oie ( Py, 7) a 4, 4-4), (31) 


where v(a@, Ay) is assumed to have a definite value. In § 2, the first term (kinetic 
energy) was considered as the unperturbed Hamiltonian and the second term as the per- 
turbing one, and the multiple interaction of a pair of particles excited from the ground 
state was treated. Then we were led to the concept of free-particle scattering matrix. 
Hence, it will be necessary to choose another way of separation of the Hamiltonian into 
the unperturbed and the perturbing one, if one wishes to take into account the effects of 
the unexcited particles on the propagator. The simplest way of this separation will be 
such that the parts of the interaction Hamiltonian which are diagonal in the number re- 


presentation are included in the unperturbed system. That is to say, the unperturbed. 
Hamiltonian is considered to be 


5 me, 
My | Dikeaatan +L Sio(ap, af) +u (ap, Ba) }ac%ay*aca, 


2m 


wah Ae 


Ba Si 
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+> dv(aa, ad) a,4,*4,2,| (32) 


and the remaining part is assumed to be the perturbing one. 
Now consider the unperturbed wave vector |n,, 1,, <-+, mj, --- >. The unperturbed energy 
corresponding to this is 


’ pe = 4 
E, (n;) =— [> kang + : DV{v (af, aj?) +v(a@p, fa) } Ny Ne 
2m\« 2V «xB 


m 
+ dv (aa, aa) ng(,—1) | (33) 
Therefore if one repeats the procedure discussed in § 2 and § 3, the energy denominator 
in eq. (13) will be altered in the following way: 
b?/2m- (ky? +ko—k?—kg’) DE, (m, m,°--) —Ey (mg, +:ta+1, mg+1, m,—1, 2,—1) 
#2 


m™ 


{E(A) + E(7) — (a) —€(8)} (34) 


where 


EV) =? 4 Mw (Ai’, I) 0, MA) Yn — Lo (, Amy. (35) 
Vou V 
As we have remarked in the previous sections, the matrix elements v(a/%, /7) can 
be replaced by u(a@/9, 47) in the case of strong interactions. Therefore, if we replace v’s 
in eq. (35) by w’s, using eq. (14) we have 
u(or, /7) 


= =4 AG 36 
E(4) +E(7) —€ (a) —E(z) Cele oe 


v(@B, ix) + Sv (a8, 07) 


as an equation to determine the u-matrix. Here €(/) corresponds to the energy of a 


particle interacting with the unexcited particles and is given by 

E(A) ye ees. {u(dM’, 47’) +u(44, HA) my, a, (AA, 2A) n, . 

EST: V 
For the ground state, i.e., for m= N and n,=0 (k¥0), the energy denominator is given 
by 
G(o, 7) = {€(0) —E(o) +&(0) —&(z))} > 
= — {k2+p[u(o0, 70) +u(o0, 07) —u(00, 00) | 
+k2+p[u(z0, 70) +u(z0, Or) —u(00, 00) |}. (37) 


It should be noted here that this Green function together with eq. (36) is just 
the fundamental equation for the ground state of the Bose system in Brueckner and Sawada’s 
theory [see eqs. (2) and (3) of reference 4), p. 1128]. 


We have shown above that the propagator alteration is carried out by a proper choice 


of the separation of the total Hamiltonian. This view-point is also useful = see how the 
situation is in the case of attractive interactions. By the attractive interactions we mean 
that the (00,00) is negative. In this case the sound velocity becomes imaginary if one 
formally applies the formula for the excitation energy spectrum” 


2 y12 
; a 


bo, = (6/2m) {(q?+ 0 {u(Oq, 0g) +u(0q,q0) —u(00,00) } P—/# (00, q— gq) 
(38) 


For in the limit q—>0, eq. (38) becomes 
bw,= (b?/2m) q {2eu(00, 00)}"*. 


Thus, the excitation energy becomes imaginary if u(00,00)<0. Such a feature is, of 
course, physically meaningless, hence it represents a breakdown in the approximation employ- 
ed rather than a physical effect. In order to clarify this point, we first note that the 
assumption that nearly all the particles are in the single particle state with momentum 
0 was used in deriving the excitation energy spectrum®’’*’. This assumption may be valid 
for the purely repulsive potentials but is entirely misleading for the attractive potentials 
under consideration. This may be seen as follows. 

From eq. (37) it follows that the Green function G(o,7) becomes positive for the 
small momentum transfer, since in the limit ¢, ->0, G(o, =) approaches — {2/u(00, 00) }~ 
which is positive. Now the energy denominator represents the energy difference between 
the ground and excited states, and the above argument indicates that the energy value 
for the state in which two particles are excited is smaller than that of the ground state 
in which n,=N and n,=0 (k#0), provided that the momenta of excited particles are 
small. Thus, the latter state is no longer stable and does not represent the true ground 
state. One of the way to find this will be the variational procedure in which the energy 
given by eq. (33) (with e’s replaced by u’s) becomes minimum with respect to the 
variation of the occupation numbers. After the occupation numbers are determined, the 
usual procedure may be applied to the determination of excitation energy. We do not 
enter into this problem here, but we wish only to remark that the variation procedure in 
the configurational space has already been employed by various workers". 
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A simpler method of deriving the Clebsch-Gordan coefficients is described. By the use of the 
new operators for angular momentum, introduced in a recent paper, the problem, which properly be- 
longs to the domain of Algebra, is tackled here by the more convenient methods of Analysis. The 


natural occurrence of the hypergeometric function is an interesting feature of this treatment. 


$1. Introduction 


In a previous paper” (referred to hereafter as (I)) the one-variable operators 
M,+iM,=e** (j+iD,), M.=—iD,, D,=d/dp (1) 


were shown to give a representation of angular momentum both for integral and half- 
integral values of j*, and it was correctly realised that their use might lead to an easier 
method of calculating the Clebsch-Gordan (briefly C-G) coefficients. An important step 
in this direction was taken by setting up a second order differential equation (eq. (4) 
below), whose solution would yield a general expression for these coefficients. As the 
very form of this equation discouraged any attempt at a solution, a recursion formula con- 
necting three successive Fourier coefhicients (which are identical with the C-G coefficients 
of the second kind as defined in § 2) was set up. But this formula was as intractable 
as the equation itself, and the possibility of getting any simplification seemed to be re- 
mote. Recently, however, it has been realised that the difficulties are illusory, and that 
eq. (4) is, in fact, one of the standard equations of Analysis in an apparently unre- 
cognisable form. Furthermore, it has been possible to set up a system of 4j+2 coupled 
equations, in a certain sense, more satisfactory than eq. (4). These new equations, being 
of the first order, can be solved immediately, and the solution involves hypergeometric 
functions, which naturally occur in the theory as formulated in the present paper. The 
entire theory can, therefore, be worked out from the known properties of these functions, 
and a general expression for the coefficients obtained without using the recurrence relations 
of Racah” or the group-theoretical method of Wigner. With the help of the transform- 
ation formulae for hypergeometric functions this expression can, in fact, be written a variety 
of forms, of which only one will be given here. The reduction to the hypergeometric 


equation places a powerful tool at our disposal and may be of help in finding out interest- 
ing relationships. 


* The results of this paper are valid for half-integral quantum numbers as well. 
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§ 2. Clebsch-Gordan coefficients of the second kind 


If the normalized spherical harmonics Y” (6, ¢) are replaced by e’”? and the usual 


-angular momentum operators by the operators (1), then the matrices assume the forms 
(M,,+iM,) m+im=j/—m, (M.—iM,) mm+i=j+m+1, MM) mim 


where, M,, M,, M_ are the components of angular momentum in units of 6. These are con- 
‘nected with the usual matrices M,* + iM,*, M.* by a similarity transformation M= A-!M* A, 
where ek ; a diagonal matrix with diagonal elements 4,,,,=[(j—m)! Gm) Ife. if 


HAS Pe e’"* is the solution of the: Coes problem, then the actual solution in 


terms BE olen harmonics is F(#, ee vies ml 3 (0,9) (see § 8 of (I)). 


m=—j 


For a dymamical system with two angular momenta M, and M, the functions /,,, 
‘which simultaneously diagonalize M.=M_.,+ M., and the total angular momentum 
M’= (M,+M,)° are linear combinations of products of the type Y7"(4,,0,) Y%?(o,@)). 
The coefficients {m,m,|jm} of these linear combinations are called the C-G coefficients. 
It will be convenient for us to call them “ coefficients of the first kind.” If the normaliz- 
ed eigenfunction Y’,,, of the operator M* is multiplied by |(j—m) ! (j+m)!]'? and the 
replacements stated at the beginning of this section are made, we get a function 


D jm = > (M,My| jm) exp (1,2, +im,Q.) . (2) 


™1.™m2 


‘The coefficients (m,m,|jm) of this double Fourier series will be called ‘“‘C-G coefficients 
of the second kind.” Evidently the connection between the two kinds of coefficients is 


m,m,| jm} = (m,m,| jm hiss m,)! Gitm,)! Ci2—m,)! Cot my)! e 
{mmm} = (om) as { 


The introduction of the coefficients of the second kind is essential for the discussions to 


follow and results in considerable simplification of the mathematical treatment. 


§ 3. The differential equations satisfied by %,,, 


We now proceed to set up certain differential equations satisfied by the function Dim 5 


which is an unnormalized eigenfunction of the operator 
M?= (M,)?+ (M,)?+ (M,,+iM,,) (M..—iM,p) 
+ (M,,—iM,,) (Mig+iM,) +2M..M.. 
=F, +F,+ e*-?) (j,+4D,) (jo—iDy) 
+ ej, — 4D) (jo+4D2) — 2D,Dy 
belonging to the eigenvalue F, where, 


F=jGt), h=iitDohehlat), Dima, Dm 


Introducing the variables, s=%,, 7=¢1—Y2, we have 
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M,+iM,=e**[j,+iD, +iD, +eF" (j. ¥ iD,) | Me-iDS 
MP=F,+F)+e!"(j,+iD,+iD,) (je-+iD,) +e“(j,—iD,—iDy) (jo iD) 
it 2 Desp Da Digs 


The differential equations are conveniently set up with the help of the commutation 
relations, [M,, M?]=[M,+iM,, M?]=0. In combination with the equation 


[M2— F0,,,=0 (3) 


they determine the functions %,,, except for an arbitrary factor, which may depend on 
ji» J» j and sometimes on,m. The relation [M., M?|=— i D,, M*|=0 implies that m=m,+m, 
is constant for a particular function %,,,. The double summation over ™, and m, in the 


defining equation (2), therefore, reduces to a simple sum, and we have 


CRS SS it aig for feo ae ge 
m2 


As a consequence, the variable ¢ drops out altogether from eq. (3), which reduces to 
eG Dy fa te) +e-™(j, +m—iD,) (j.—iD,) 
+2 (m+ D,)D,+F,+ F.— Flt =0.- (4) 


This important equation already occurs in (I). It determines the functions 7,,,, and 
therefore C—G coefficients, up to an arbitrary factor involving j,, j., j, ™. 

Let us now examine the consequences of the pair of relations |M,+iM,, M*|=0, 
which imply that (M,+iM,)@,,, is a linear combination of the 2j/+1 eigenfunctions @,,. 
for different values of m. The occurrence of the factor e** in M,+M, further ensures 
that this linear combination consists of a single term @,,,.;. On suitably adjusting the 


arbitrary constant in @,,, and dropping the variable ©, we have 
Lii—m+iD, +7" (jp—iDy) \Zim= (J=—™) Lim 1 (Sa) 
| jrtm+ 1—iD, +e" (j,+iD,) Lim Dace G+m+ 1) Lim : (Sb) 


It is easily seen that they lead to eq. (4) and determine the functions Ym up to an 
arbitrary factor independent of m. This factor can be determined by normalizing the 
simplest of the functions ‘/’;,,, namely, ¥, or WV, _,. 

As it will not be necessary to consider more than one value of j at a time, it is 
better, at this stage, to drop the subscript j, and write Am for Yim, Pp for Din, etc. 
We notice that eq. (4) remains unchanged on changing the sign of m and passing on 
to the complex conjugate. An immediate consequence of this symmetry property is that 
1xm=Cmifm- Making the same changes in eq. (5a) and comparing with eq. (5b), we 
see that C,,=C,,_,;=C is independent of m. From %o* =CyY it then follows that|C|=1. 
The exact value of C can be obtained by comparing 7; and 7_; given in eq. (9) below, 
and turns out out to be C=(—1)*#-)_ Therefore, 


Nance ol) e aye (—m,—m,|j, —m) = (mm, | jm) (— 1) #*8-5 (6) 
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To bring eqs. (4), (5a), (5b) into more convenient forms and to show their con- 
hection with the hypergeometric equation we put x=e"". This gives 


[——(i—m-+xD) (jo+xD) +x (j,+-m—xD) (jo.—xD) +2(m—xD)xD 


4 °F HE Fly 0, Dee (7) 
ive (Ys) =[x(1—x) D+j,—m+ jx], = (j—m) Am+1 > (8a) 
Ln) —x) D+ f+ m+ 1+ jo/x)%mi1= vee 1) Ym (8b) 


The eqs. (8a), (8b), being of the first order, can be solved easily. Putting m=j in 
(8a) and m=—j—1 in (8b), we have 


Y= "(1 —x) viva ; oe x) a a a (9) 
Next, put 


Ym = XRA™ (Lx) BB-3 Uy = KP (L—x) ARS gy (10) 
This gives the following two alternative forms of the set of equations (8a), (8b) : 
[A —x)D+j—m}um= Gm) tm st 5 
[x(1—x) D—j, + jotmt (jo tf) xWm= GH™) tenn 
[x(1—x) D+j,—jo—mt+ (— jt ptf) *]im= (G—m) Umit » 
[Q—x) D+j+m}o,= Gm) Um-1- 


(11) 


(12) 


A comparison with the relations between contiguous hypergeometric functions, namely, 
(xD-+ a) F(a, 6; c; x) =aF (a+1,-b; c3 x) , 
[x(1—x) D-+c—a— bx] F (a, b; c; x) = (c—a) F(a—1, 8; c; x) 
now shows that the solution of the set (11) is 
tin =F (— jm, —jrtj— js —273 1-9) 
and the solution of the other set (12) is 
Vega l (=f Mgr fees 27s 1—x). 
We can get two other forms of the solution by making use of the relation (6). These 


are 
Gx) Ort iy erat F(a) aan, Weajere = 2ii* (x— 1) /x) 


mac httim F(—j—m, jth js —2)s (1/9. 
The four different froms can be obtained very simply from Kummet’s connection formulae. 
Other forms may be obtained, for instance, from the relation 
F (a, b; ¢; x) =['(c) I’ (c—a—6) / {LP (e—4) ['(c—6b)}. F(a, 6; a+b—c+1; 1—x) 


which is valid when at least one of the parameters a, b is a non-positive integer, and the 
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denominator in the successive terms of the series on either side of the identity does not 
vanish earlier than the numerator. The factor involving Gamma functions can often be 
evaluated by a limiting process even if it is meaningless. Each new form of the solution 
leads to a different expression for the C-G coefficients. But it is usually difficult to estab- 
lish the equivalence of the various expressions by direct algebraic reduction. 

The foregoing discussions make it clear that eq. (4) or (7) is a hepergeometric 
equation in disguise, and can be reduced to it by either of the substitutions (10). Eli- 


mination of 7» Of %m+i from eqs. (8a), (8b) leads to the equations 
[ist' Ee — (j—m) (j+m4+1) ]yn=0, and (L,""L3— (j+m) (j-m+1)]y.=0 


and these must be identical with eq. (7). The entire set of coupled first order equations 
can, therefore, be replaced by the single second order equation, which may be taken to 
be the basic equation of the problem. 


§ 4. Determination of the constant 4, 


As @,, has a definite connection with the normalized function ¥,, the multiplying 
constant in 7,, cannot be arbitrary, but must have a definite dependence on j,, js, j. To 


use Racah’s notation we write 
t= 4, (2)Y Gite" Aa) 8 F(t m, —jytjej3 —2j3 1-2) - 


The constant A; is most easily determined by normalizing the functions ¥, or ¥_,, that 
is, from either of the relations 


>} {m,mg| jj}? =>} {mym,|j, —j}?#=1. (13) 
ma 


m2 


The C-G coefficient (m,m, 


jij) is the coefhcient of x™ in the expansion of 7, : 
(mymg| jj) = 4,(2j)!(—1)*—-™/[ (Gj, —m,)! Gog) !]. (14) 
The coefficient of the first kind is, therefore, 
{mma jj} =A, (— 1) (2) 1Cjy tm)" Geem)!/ { Gi — mm) Ga— me) IEF. 


The relation (13) now gives*, on writing ¢ for j,—m,, 


; ACatp—j—9! (atj—j)! 
X FC fHp hpi lS — 27,3 ay 
Gauss’s formula, F(a, 6; ¢; 1) =/"(c) ’(e—a—b) / {'(c—a) I'(c—6)}, then gives 


Atay CINCOM jitptit9! _— @)QiIM=fjtieti! y 


A= Q2j+V) Gate POG RAD HAtht)!Gitptj+y 


This method of evaluation of A, is perhaps simpler. 


ey : : 
. Flere, as in all formulae of this paper, the summation index takes all integral values consistent with 
the factorial notation, the factorial of a negative number being meaningless 


~ 
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§5. General formulae for the coefficients 


From the various expressions for 7,, it is easy to obtain general formulae for the 
C-G coefficients. In the second form they are identical with the coefficient of x” in 


the expansion of Am: To get a formula for the coefficients we select, at random, any 
particular from of the solution, say, 


Ym = A, (2))!/ jst jo—j) x (1a) IF (= jm, jy—jy—j 3 —2) 3 1-2) 
ee, As(—fatptyp!Gj+m)! 


rr x 
Cit+fe—j)! 
A Mastic: 2j—t) int fej +! 
Soe ore aera Ga ne a 
oe A j+m—)\(—j,+ptj—O's\(itp—jtt—9! 
whence, 
See Mh Ce ! 
(rma) = Dy! 
Cat p—p'Getm,)! 
SS cia NEP irae as ena (15) 
Z AC j+m—t)!(—j,+p+j—9!G,—j—m+h!. 


The correctness of this formula can be tested by seeing if it satisfies the two recurrence 


relations, 


(j—m) (mm, +1|j m+1) = (m,—1 m,+1|jm) (j,—m, +1) + (mm, 


jm) (jo— My) 
jm) (jo+m), 


which follow from eqs. (5a), (5b), and are substantially the same as Racah’s relations” 


(j+m) (mm, — 1|j m—1) = (m, +1 m,— 1|jm) (jr, + 1) + (mm, 


(3) and (5). A simple calculation shows that it satisfies the second of the two tela- 
tions. Instead of trying to verify that it satisfies the other relation also it is much 
easier to show that it gives the correct initial conditions, that is, the expression (14) for 
(m,m,|jj). That it does so is immediately seen by writing the expression, obtained from 
(15), in the from 

A,(—}j, +jot+y))!(2j)! Ss 1 Jatmott aa Se Gee yo jot) baat de 
ieee es CU a ot share haat vee I OE es 

Citse—j)! Germ)! AC ftptj—)!Gi-j-mt?)! 

= A, pis hai: paca @ —x) a iasashestei x=1. 

Git je—p)! Ger me)! 


(m,m,| jj) = 
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Electronic Structure of the Exciton in Ionic Crystal 


Further Discussion of a Localized Exciton (Zero-th Order Description) and 
General Formulation of a Moving Exciton in Alkali Halides 


Toshinosuke MUTO, Sei’ichi OYAMA and Hiroshi OKUNO 
The Institute for Solid State Physics, University of Tokyo, Tokyo 
(Received July 24, 1958) 


After a careful discussion of the characteristic features of the foregoing theories of an exciton in 
ionic crystal, a new approach to the exciton problem has been proposed, in which the one electron 
orbital is shown to yield already some of the characteristic properties of an exciton, an @-exciton (an 
exciton localized in the immediate vicinity of a negative ion vacancy) and an X-ray exciton in com- 
parison with the experiments performed so far. In view of the present one electron orbital yielding 
a satisfactory zero-th order description of an exciton, a general formulation of a moving exciton based 
on the present one electron orbital is developed in connection with the alkali halides and an ap- 
proximate estimate of a singlet exciton peak in KCI has been carried out, getting agreement with the 
experiment as far as our approximate evaluation of the various quantities involved is permitted. A 
brief discussion of the effect of spin-orbit coupling on exciton multiplet has been made and a possible 
interpretation is presented on the splitting into doublet of the exciton peaks in KCI and NaCl below 
150°K as recently observed in the reflection method of measurement by Hartman, Nelson and Sieg- 
fried. Finally, the comparison of Slater and Shockley’s theory of an exciton with the present theory 
is discussed in some detail and the latter is concluded to be more suitable for a small size exciton. 


S 1. Introduction 


As is well known,” two distinctly different treatments for the electronic structure of 
an exciton have been formulated, each with its range of applicability. An atomic orbital 
approach along the line first proposed by Frenkel and recently extended by Overhauser” 
and others gives the solution for the lower exciton states. This treatment involves formu- 
lating the exciton wave function from atomic orbitals of the atoms at crystal lattice sites. 
The original Frenkel proposal considered linear combinations of one electron orbitals re- 
presenting individual atoms in the crystal being excited. The formation of one electron 
orbitals representing an electron excited to an atomic orbital on a nearest neighbour atom 
(an electron and a hole on nearest neighbours) hes been discussed most recently by 
Overhauser.” The one electron description of the latter atomic orbital approach has been 
dealt with already by von Hippel” along the classical consideration. On the other hand, 
an alternative treatment based on the band theory for the exciton structure was first given 
by Slater and Shockley.” Their proposal for an exciton can be most clearly pictured as 
an electron in a conduction band being bound to a positive hole in a valence band owing 
to their electrostatic attraction, which eventually leads to the effective mass approximation 
for the exciton problem, first given by Wannier and further discussed by Meyer,” Luttinger 


ety 
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and Kohn,” and Slater and Koster.» Recently, the extension of the effective mass ap- 
proximation to the degenerate energy band edge has been discussed in detail by Dresselhaus” 
in connection with the exciton problem. The various developments in the exciton-theory, 
worked so far, may be considered to belong essentially to one or the other of the men- 
tioned alternatives in the main features. 

The common characteristic features of the above two methods of treatments for the 
exciton problem will be described in the following way. At first, the appropriate one 
electron orbitals are constructed in which an electron is moving in the specified electro- 
static field, and then Slater’s determinantal functions are set up corresponding to each 
electron configuration. The configuration interaction is worked in a suitable way, taking 
account of the effective electron configurations. The exciton characteristics in such methods 
of procedure do not make their appearance in the zero-order description but only in the 
higher order approximation through working the configuration interaction. In fact, the 
influence of the surrounding ions of the crystal on an excited electron has been entirely 
disregarded in the zero-order description of Frenkel’s atomic orbital approach,” whereas, in 
the band theoretic approach of Slater and Shockley,” the effect of an excess positive 
charge is not carefully taken into account in the one electron orbitals, the field acting 
on an excited electron being perfectly periodic in the same way as an additive electron in 


ionic crystal. 


Now, when we remember the fact that the self-consistent procedure of constructing 
the Hartree field of a crystal electron, as in the atomic structure, can hardly proceed 
mainly due to the mathematical difficulty but, instead, the periodic field assumption is 


usually adopted from the outset,’ 


it seems to be worthwhile to investigate carefully the 
best one electron potential in the crystal in connection with the exciton problem. In 
fact, the crude consideration seems to show that, in the case of metals, an excess positive 
charge left after the removal of a concerned electron may be considered to be smeared 
out rapidly over the whole crystal owing to the screening of the remaining electrons of 
high mobility so that the resulting electrostatic field on a concerned electron will become 
sufficiently periodic in space, whereas, in the case of ionic crystals, the mentioned positive 
charge may be reasonably supposed to be more or less localized around an ion or a group 
of ions of the crystal since the electrons of ionic crystal may be regarded to be relatively 
strongly bound around the individual ions in the crystal. The Hartree field in ionic 
crystal, therefore, will probably be considered to become non-periodic in the neighbourhood 
of the mentioned ion or the group of ions. Thus, we shall have to deal with the 
perturbed periodic field for treating an electron in ionic crystal in a sense of Hartree ap- 
proximation, in contrast to the periodic potential field in metallic crystal. 

Our new approach to the exciton problem is to adopt the one electron orbital in 
such perturbed periodic field as a starting point of the conventional method of treatment 
of many electron system in ionic crystal. In fact, a suitably chosen perturbed periodic 
field for potassium chloride has been shown, in Part I,’ to give rise to a number of 
localized states just below the conduction band, which have actually some of the exciton 


? oe, 12) 
characteristics observed so far and are found to correspond to Dexter’s excitation model. 
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The comparison with experiment seems to show good agreement under a fewer assump- 
tions compared with the case of von Hippel’s model.” In order to extend the validity 
of our method of procedure for working the zero-order description of an exciton in alkali 
halides, the similar calculations to the previous one are developed to all alkali chlorides 
and, further, to an exciton localized around a negative ion vacancy in alkali chlorides 
which corresponds to one of the Greek bands observed by Pringsheim and others’ in 
alkali iodides and bromides; their results shall be described in the next chapter together 
with some remarks on the X-ray exciton in KCl.’ As a next higher approximation, we 
shall deal with a moving exciton by taking into account the configuration interaction 
based upon the one electron orbital mentioned above. The actual procedure of treating such 
moving exciton will be carried through in a straightforward way and leads to the character- 
istic secular equation from which the exciton multiplet can be deduced, at least in principle, 
in view of the branches of a valence band and the spin-orbit coupling. The result seems 
to roughly correspond to some refinement of the pioneer work of Frenkel and will be 
reasonable particularly for the small size exciton or the lower exciton states in the crystal 
of relatively small dielectric constant. 

In this connection, it should be mentioned that the exciton problem seems to be 
worthwhile paying attention for the following reasons : first, it can provide us very important 
features of crystal electrons which can not be satisfactorily dealt with by the simple 
band theory which has been fruitful for solid state physics, and secondly it is regarded as 
one of the typical problems of the correlation effect of crystal electrons which can be 
manipulated mathematically through taking into account the configuration interaction, 
although a strong correlation in large scale of crystal electrons is shown to be responsible 
for the collective oscillation of plasma type as shown in Bohm and Pines’ theory.'” 


§ 2. Leealized exciton in alkali halides 


(Zero-order description of an exciton) 


a) Optical exciton in alkali chlorides.* 


According to the consideration described in Introduction, our method of procedure 
in Part I’) has been extended to all alkali chlorides which have actually been observed 
experimentally. Our main task is to solve the eigenvalue problem given by Wannier- 
Slater’s equation valid for alkali chlorides which involves, as parameters, effective mass m* 
near a conduction band bottom and effective dielectric constant « responsible for the 
polarization of surrounding medium. Our numerical results are shown in Table I in which 
the observed values are quoted from Hilsch and Pohl, and Schneider and O'Bryan.” 
—E’ represents the energy difference between a ground state of an exciton and a conduc- 
tion band bottom. According to the recent experiment by Hartman et al.,! however, 


* The brief account of the present section has been re i 
ported in Short Notes of J. Phys. 
12 (1957), 108. eine 
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the absorption peak corresponding to a series limit of exciton states in NaCl seems to 
vanish, which fact requires further careful considerations from both experimental and 
theoretical sides. 


Table I 
— Evys/ 
Licl | 2.75 0.6 Gao ey | 0.86 ev 
NaCl | 2.25 1.0 1.81 1.83 
KCl | ais 0.85 1.83 1.80 
RbCI | 2.19 0.7 1.53 1.50 
CsCl | 2.60 0.9 13) 1.39 


At first sight, it might be argued that a variety of the possible values for two 
parameters m* and « may happen to give equally good fit to the experimental results. 
However, a general consideration, both experimental and theoretical, requires for « to lie 
in the range of one to high frequency values and for m* to be of the order of magnitude 
ranging from 0.5m to m, m being the true mass of an electron, which situation enables 
one to get almost unique assignment of these quantities in our method of procedure. 


'D in the case of KCI, the effective dielectric constants 


According to the previous experience 
« have been assumed to be high frequency values in our results shown in Table I. The 
assigned values of m* are observed to be of the order of magnitude to be expected in 
accordance with the available other experimental evidences. Generally speaking, the com- 
parison shown in the table seems to be good for the zero-order description of an exciton 
as far as the approximation in our method of procedure is allowed for. 

b) X-ray exciton in KCl. 

In Part I we have worked the numerical estimates of the location of a ground state 
of X-ray exciton from the bottom of a conduction continuum in KCl in comparison with 


) 


those of Parratt and Jossem’s observation.'” The results were as follows. 


| Our estimate Parratt-Jossem’s one 
Kt | 3.39 ev 3.2 ev 
Clix | 2.5 ev 4.13 ev ‘ 


Our estimates were obtained by tentatively assuming 0.8 m for the effective mass near 
the conduction continuum bottom and 1.5 for the effective dielectric constant in view of 
the short period of X-rays. The discrepancy seems to appear in the case of Cl”. How- 
ever, since the bottom position of a conduction continuum is not entirely recognizable in 
the observed contour for either chlorine or potassium ions of KCl, Parratt and Jossem 
seem to have actually made use of the following empirical relation of the width of a 
valence band (4E,) with the location of an X-ray exciton peak due to Cl” ion (A peak 
in their contour) from the bottom of a conduction continuum (|E|’), which relation is 
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established through the careful analysis of both K, emission and K absorption spectra in 
Cl ion of KCI together with its ultraviolet absorption spectra. 


9.46—|E’|+4E,=9.75 (in ev), 


in which 9.46 eV represents the energy gap between the conduction and valence bands, 
estimated from the second peak of the ultraviolet absorption and 9.75 eV the energy 
distance from the valence band bottom to a X-ray exciton around Cl ion (A peak), 
estimated from K, emission line and A peak of X-ray absorption in Cl-. If the above 
empirical relation were accepted, we would have very interesting checking for the location 
of the conduction continuum bottom, which depends on the width of a Cl 3p valence 
band in KCl through the above relation. In fact, Parratt and Jossem seem to have sup- 
posed for the width of a Cl 3p band to be 4.42 eV, probably suggested by Shockley’s 
computation of the energy band width in NaCl, which resulted into 4.13 eV for the 
location of a conduction band bottom from the exciton peak, as mentioned above. On 
the other hand, the substitution of our estimate of |E’|, 2.5 eV, in the above relation 
leads to the width of valence band of 2.79 eV which is narrower than the above men- 
tioned one. When, as suggested by Parratt and Jossem,'® the polarization around a 
chlorine ion lacking a 1s electron is taken to become larger compared with that around 
a potassium ion lacking a 1s electron, owing to the larger volume of the former, and 
provisionally we take «=2.0 and m*=0.8 m for the former, the interpolation from Table 
I of Part I leads to the X-ray exciton state at about 1.7 eV below the bottom of a con- 
duction continuum and, consequently, to the valence band width of about 2 eV in KCI by 
using the empirical relation. On the other hand, a recent theoretical calculation by Howland™ 
of the energy bands in KCI by the use of the Bloch sum method indicates increasing 
favour for a narrow valence band of KCl, i.e., 1.52 eV for the width. In view of the 
above situation, our estimate for the location of X-ray exciton relative to the continuum 
bottom seems to be on a right line, though qualitatively, as far as the zero-order descrip- 
tion of an exciton is concerned. 

c) An exciton localized in the immediate vicinity of a negative ion vacancy in alkali 
chlorides (a exciton) .* 

Recently, Bassani and Inchauspe™ have calculated the energies to create an exciton 
localized in the immediate vicinity of a negative ion vacancy or an F-center by a similar 
cycle method as was adopted in the semi-classical treatment of von Hippel. In fact, the 
appearance of the new absorption bands in the tail part of the fundamental absorption 
when a large number of halogen vacancies or F-centers are present in the crystal seems 
to be ascribed to such localized excitons. In view of the above situation, our method of 
procedure mentioned above has been applied to such case. For an exciton trapped in the 
immediate vicinity of a negative ion vacancy, the potential field in the Wannier-Slater 
equation is taken to be the negative of the Hartree field”? due to the nearest neighbouring 


* The brief account of this section was read at the annual meeting of Physical Society of Japan, 
October 17, 1957, 


— tt 


Electronic Structure of the Exciton in Tonic Crystal 809 


Cl ion, i.e., V..(r—mna) in addition to the potential field of a normal exciton which 
has been worked out in Part I, i.e., V(r). Furthermore, the former potential is assumed 
to be reduced by a static dielectric constant of the crystal, while, the latter one by an 
effective dielectric constant as was done in Part I. The resulting potential is easily seen 
to extend toward a neighbouring negative ion vacancy, as described in Fig. 1. 


Fig. 1 


(ev) 


Vir) + Vig (r—na) 


—8 
(ev) 


The Wannier-Slater equation for the present case becomes 


Fagin) + (V0) +¥ ar —na)} (7) SE" Y(r), _ 


2m* 


whose solution has been obtained, for the ground state, by the variational principle. The 
trial function is assumed to be given by a superposition of a p-type function on an s- 
type one, taking into account the fact that the attractive field extends toward a negative 
ion vacancy. The obtained numerical results are shown, for alkali chlorides, in Table IJ, 
together with the results of Bassani and Inchauspe (BI) and the observed values (4E.x,)- 
E’ (a-exciton) and E’ (free exciton) represent the energies of an a@-exciton and a normal 
exciton from the conduction band bottom, and JE’=|E’ (a-exciton) —E’ (free exciton) |. 
The comparison seems to be rather good, allowing for the approximate nature of our 
method of procedure. Unfortunately, however, Greek bands in alkali chlorides have not 
yet been reported, probably because they lie in the vacuum ultra-violet. Finally, it should 
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Table II 

if E’ (a-exciton) | E’ (free exciton) AE’ BI AE exp 
LiCl ; _1,19 ev | —0.89 ev 0.30 ev — — 
NaCl 239 | test 0.58 0.50 ev = 
KCl Py, | =(s3 0.54 0.53 = 
RbCI —2.09 | 21353 0.56 = — 
CsCl —1.76 —1.33 0.23 - -- 
KBr = | it = 0.49 0.59 
KI _ | — — 0.44 0.59 


Se ee ee ee Ee ee 


be mentioned that the rather rapidly varying region of the potential in (1) has been 
found to be less sensitive to the energy determination and then the approximate validity 
of the Wannier-Slater equation seems to hold. 

Summing up the results obtained so far, the one electron orbital in a suitably con- 
structed, perturbed periodic field may be considered to afford a good zero-order description 
of an exciton in alkali halides, with fewer assumptions and much smaller corrections than 
the foregoing models. Furthermore, according to the recent discussion by Dexter,” the 
oscillator strength and charge distribution of the present model have been shown to become 
a satisfactory zero-order description of an exciton. In the present calculations, however, 
our main concern lies in the electronic structure of an exciton in alkali chlorides in which 
a chlorine ion has relatively small spin-orbit interaction. In order to work the exciton in 
alkali bromides or iodides which show sizable fine structures, we shall have to take into 
account the spin-orbit interaction and spin-spin interaction between an electron and a hole 
within a localized exciton, by introducing them appropriately into our Wannier-Slater 
equation. However, such higher order interactions should be taken into consideration in 


more reasonable way in a many electron treatment of the following chapter. 


: § 3. A moving exciton 
a) General consideration.* 


As stressed in the previous chapter, a localized exciton discussed above is regarded 
as the zero-order description of an exciton. However, a localized exciton state in unit 
cell is energetically equivalent to that in any other unit cell as a result of translation 
symmetry of the crystal lattice, so that such a state of excitation will propagate through 
the crystal. Next we shall proceed to work a mathematical formulation of such a moving 
exciton in the crystal on the basis of the one electron orbital described above. 

In the following computation of a moving exciton, therefore, the wave function of 
the localized exciton state just below the conduction band is adopted for one electron 
orbitals of the excited states. The normal states of valence electrons are supposed to be 


* The brief account of this section has been reported as Short Notes in J. Phys. Soc. Japan, 12 
(1957), 108, 
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approximately described by the Bloch orbitals of the simple band theory since the majority 
of valence electrons of the crystal is expected to lie in the valence continuum which 
behaves quite similarly to the valence band of simple band theory, and a few of them 
is brought into the localized states just below the valence continuum, as revealed in the 
K, emission line’? in KCl. Furthermore, the inner shell electrons within each ion of the 
crystal will be amalgamated into the atomic cores whose interaction with valence electrons 
is represented by a suitable potential field V,,(r). 

Then, Schroedinger wave equation of N valence electrons in the crystal may be 
written as 


29 N “Ww N 2 
He=|—" S144 SW) + 3) | PHED, (2) 
2m i=1 t=1 i>j 154 


where i specifies each of the valence electrons, 7,;=|r;—r,| and the other quantities have 
the usual meanings. 

As for the available electron configurations, we shall confine ourselves to the configu- 
rations with one electron excited to a ground state of the localized exciton below the 
conduction band from any one level of the valence band. Different configurations will 
be specified by different hole states of the valence band and by different locations of the 
localized exciton. The other possible configurations in which an electron is raised into 
the various excited states of a localized exciton and the conduction continuum, and, further, 
two or more electrons are raised into the similar excited states, will be entirely disregarded 
mainly for the sake of mathematical simplicity. _ However, the generalization of the 
present formalism by allowing for the neglected configurations will be easily carried out 
and will lead to the secular equation of the higher degree. The mentioned, approximate 
procedure of a single exciton will be safely applicable to the lowest exciton bands as far 
as the concentration of existing excitons is so low that their mutual interactions need not 
be taken into account. Throughout the remainder of the discussion we shall refer to 
KCI, regarding it as a prototype of the other alkali halides. 


As actually shown in Part I,'” the one electron orbital of a localized exciton around 


R, becomes 


c(L; r) = S)U(R,—R,) -4.(r—R,) G3) 


which satishes the Schroedinger equation of the perturbed periodic lattice, 


ie a ALV,!(r) +4V(r—R)| -c(l; rye saatl -7T) (4) 
2m 


and the Bloch orbital of the n-th branch of a valence band is given by 


ballon tT) = GP peor a, (i Fe (5) 
which satisfies 
{= s-+¥,"(r)} by (les 7) =En Ch) bu (59) (6) 
m 
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since a Cl 3p band, i.e., valence band has been shown to split into three branches by 
Bethe” and Howland’ using the Bloch sum method, each branch et the valence bac 
being designated by a sufhx n. V,/(r) expresses the periodic potential = an additive 
electron in KCl and JV(r—R,) the potential due to a positive hole localized around a 
lattice point R,, which has been actually worked out in Part La, ray and a,(r 
—R,) represent Wannier functions around R,, belonging to a conduction rap and the 
n-th branch of a valence band, respectively. U(R,—R,) has been shown, in Part I, to 
become of exp(—Z|R,—R,|/a,,) for the ground state of a localized exciton. rs 
more, E**° and E,(k) express the energies of a ground state of a localized exciton and 
of the n-th branch of a valence band, respectively, k being wave number vector. It 
should be mentioned that one electron orbital of a localized exciton (3) is orthogonal to 
the Bloch orbital of a valence band (5), as clearly shown by the orthogonality property 
of the corresponding Wannier functions. 

For the ground state of the crystal as a whole, the valence band is completely filled 
with electrons and a localized exciton state or a conduction continuum is entirely empty. 
The corresponding electron configuration corresponds to a singlet state, of which the deter- 
minantal functions in the Bloch orbital method and in the Wannier function method 


are identical with each other. 


The corresponding energies become 


E,= 2) (a(R), H(t) a, (RR) 


+3083 | (4.7) Ry) ay (r,—R,), e*/r,.4,(r,—R,)a,,(r.— R,) ) 


nR)>mk, 
re (a,(r,—R,) 4,,(r,-—R,), e r194,(r,—R,) a,, (r,—R,)) t > (7) 


in which n or m specifies each branch of a valence band as well as the corresponding 
spin state, and H(i) = —h°*/2m-4,4+V,(r,). 

For the electron configurations corresponding to the excited states, let us suppose 
that an electron is excited from a k, state of the n-th branch of a valence band to a 
localized exciton level around a lattice point R,. Then, according to the different spin 
orientations of both an excited electron and a positive hole, we can construct as usual 
the determinantal functions belonging to singlet and triplet states, which consist of the 
Bloch orbitals of a valence band and a localized exciton wave function. When we multiply 
each determinantal function mentioned above by exp(tk,-R,) and then sum up over ky, 
the resulting functions are shown, using Slater and Shockley’s procedure,” to be propor- 
tional to the determinantal functions built up from the Wannier functions themselves, 
belonging to a valence band, i.e., a,(r—R;,) and a localized exciton wave function, 
c(l, r). The latter determinantal functions contain an excited electron in a localized 
exciton state around #, and a positive hole in the state represented by a Wannier func- 
tion around R, of the n-th branch of a valence band. Thus, the determinantal functions 
corresponding to singlet and triplet states of the crystal as a whole may be written as 


—aAe 
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typeset) — I {Dini vi hyo, eink dae! ht 
Wee 


and 


Sareaae?) —_ 1 


7= 


{Dont viet ¥) _ Gent hs et} 
? 
V2 


Sa sel) = Dor +; Hon 


3 pr mi.et) = Qe AS as 


813 


(8) 


(9) 


(9)’ 
(9) 


respectively. @""¥i%¥) represents the determinantal function built up from the Wannier 


functions and a localized exciton wave function, by assuming an electron with down- 


spin to be removed from the state represented by a,(r—R,)(c) and to be present in 


a ground state with down-spin of a localized exciton, i.e., c(L; r) (oc). 


a(o) and 


(oc) represent spin functions and (nl |) specifies a hole state and (el |) an excited 


electron state, respectively. Its explicit expression becomes 


n 5 6 1 IN 
he%s = o,° P \4 (r,—R,) a(o,) “a, (r,1— R;) a(o5-1) 


xe(l 5 rp) 2 (Fp) Ay (pvt — Ry 41) & (p43) +d; (Py— Ry,) B (cy) 


(10) 


in which a,(r,—R,)8(o,) is replaced by c(U; r,) B(c,) in the ground state function. 


The other determinantal functions in (8) and (9) are similarly defined, corresponding 


to the various spin orientations of a positive hole in a state represented by the Wannier 


function of a valence band and an excited electron in a localized exciton state. 


Allowing for the configuration interaction, we can set up the following linear combi- 


nations of (8) and (9) in order to solve the Schroedinger wave equation (2). 


Pa Ay (POM, t= 1, 3 
nb 


(11) 


which takes into account the interaction among the various configurations specified by 


the different locations R, of a localized exciton and a positive hole, and by the different 


branches n of a valence band in which a positive hole is situated. 


Substitution of (11) in (2) leads, as usual, to the following secular equation. 


S ere. Hegre) a re ='E'A,, . 


nies 


(12) 


In view of the translation symmetry of the energy matrix elements in (12), we can 


put, as usual, 


t _— ptK Rj t 
‘Ayee i ih 


(13) 


in which K represents a translation vector in the reciprocal lattice or a wave number 


vector of a moving exciton. 


Then, it follows 
Sat ee Coe H é yr(ntt! el!) eK (Ry -R) — 14 | iB 
v/ 


nt 


(14) 
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which leads to a 33 secular determinant, 


det | a Gp. H a! Aigabiol | eK -Cty—Ki) a oe Fenul ie 
Uv 


(15) 


The diagonal elements of the energy matrix becomes, after some process of calcula- 
tions, 


pe 


(et) 


E@” = Cowen Higr) ae SV eh Cti—- FD (Spe. Hig) 
¢ 


= Es Ee E10) els (el, nl 


e?/ry\nl, el) + (BV,—OV,) 
+ SY &h-(hu-Bd/ — §,,-&,(Ry—R,) — Sp > 10, al! \2/rl7, nf) 


ED (Eni) 


— (r, nl! \e?/r.|nl, r)} +5 (el, nl’ \e?/r,o\nl, el’) — (el, nl’ \e*/r,.\el’, nl) ], (16) 


where 2 or 0 and + signs stand for singlet and triplet exciton states, respectively. 
ei) = Gs. > Each) ae 
k 


w= jrus r)c(l;r)dr, (Overlap-integral) , 
(el, nl|e”/,o|nl, el) = ac Sra" (r,— R,) ry ¢(L; ry) a,(r,—R,)dr, ‘dry, (17) 


el int Vey r3| nl vel" r= Jed r,)4,* (rs —R,) 2 /ry-c(U 3r,) 4, (7, —R,) dr, dro, 


etc. 


The summation with respect to r in (16) extends over all states including spin except 
for a,(r—R,)a(o) and, furthermore, we have 


OV = (el |V,—V,'— AV |el) + Se elle*/rya|r, el) — (1, elale*/rplela, r)} (18) 
r(enla 2 = 
and 


OV = (nl |V,—V,!\nl) + {(r, alle/r|7, nl) — (7, nlle?/rglnl, )}. (19) 


On the other hand, the off-diagonal elements of the energy matrix may be written 
11, 7F) 2 o/ 
Emm = — S) L(x, n'lle*/rg|r, nl) — (1, n'l|e*/r,,|nl, 7)} 
r(senila) bs ak 


2 2/ 
+ (el, n'lle?/ryo|nl, el) — (el, n'l\e®/r,glel, nl) 


ais Sel et (Ri Ri) aa Sv 3 IP 2 ao iy 9) 
7Te™ 5) | Sur 23,4 nil|e’/nalr, nl) — (7, nl |e*/rip|nl, 1) } 


z 3 ; 
+ 9 Cel, n'l’ |e? /ry9\nl, el!) — (el, n'l’|e®/r, | el’, nl) ]. (20) 
According to the valence band formula” obtained by the Bloch sum method, & (0) 
represents the sum of the atomic energy of a 3p electron within a chlorine ion and the 
Coulomb energy of the corresponding electron in the field of all negative and positive 
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ions in the crystal except for a chlorine ion to which the electron belongs. Such Coulomb 
energy will be approximated by Madelung’s potential Vy, at a lattice site of a chlorine 
ion. Then, we have 


E, (0) = PB ta oP) = UA ip Vy= = Wea Tas (21) 


where @=1.784 for NaCl type lattice and r, denotes the cation-anion distance. 

The first term on the right hand side of (18) represents the negative of the Coulomb 
and exchange energies of an electron within a localized exciton around R, in the charge 
distribution of all the remaining valence electrons, whereas, the second term of it expresses 
the similar potential energy computed by the Wannier functions belonging to a valence 
band. Similarly, the first and second terms on the right hand side of (19) represent 
the corresponding quantities of a positive hole in the n-th branch in the Hartree-Fock 
charge distribution of all the valence electrons. If the Coulomb and exchange energies 
in the second terms of (18) and (19) were evaluated by using the Hartree-Fock solu- 
tions instead of Wannier functions, we would have exactly 0V,,=0V,=0. Otherwise, 


OV, and 6V, will not necessarily vanish but will be expected to remain small in our 


approximation. Therefore, (0V,,—0V,) may be reasonably supposed to become still 
smaller, on account of which we shall neglect it approximately in the following discussion. 
Each term of the first line of (16) has fairly simple interpretation on the atomic view- 
point, of which the first term represents the ground state energy of N electrons, the 
second the one electron energy of a ground state of localized exciton, the third the energy 
of a positive hole in the 3p state of a chlorine ion within the crystal and the fourth 
the exchange energy between an excited electron and a positive hole around the same 
lattice site. The remaining expressions of (16) represent the interaction energies due to 
the electrons in the various states with different locations in the crystal. Further, in 
(20) are involved the Coulomb and exchange interaction energies among the states belong- 
ing to different branches of a valence band and to a localized exciton. 

Generally speaking, the 3X3 secular equation will have at most three roots for the 
energy of a moving exciton, corresponding to each value of K. Thus, the three branches 
of exciton band could be expected to appear as far as our approximate configuration 
interaction is valid. However, when we take into account the effect of spin-orbit coupling 
and, further, the contribution from the higher order configurations in which an electron 
is raised into the excited states of a localized exciton and a conduction continuum from 
any one state of the valence band, the corresponding secular equation is readily seen to 
become of the higher order, which results into an increasing number of splitting of the 
exciton band. 

In order to solve numerically the secular equation for the exciton band, we are 


actually required to work out the Wannier functions belonging to both valence and con- 


duction bands. One of the authors* has recently worked out the Wannier function of 


a valence band KCl, by making use of the Bloch orbital obtained by Howland.” His 


Wannier function of localized type may be written as 


* S. Oyama: Busseiron Kenkyu (1958) (in press). 
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a,(r) =0.8-4,(0) +0.086- (y,(100) +¢,(100) ) 
—0.016- (y,(010) +¢,(010) +¢,(001) +¢,(001)) 
+0.003- (,(110) +¢4,(110) +¢,(110) 
+¢4,(110) +4,(101) +4,(101) +4,(101) +¢,(101) ) 
—0.008- (g,(011) +¢,(011) +¢,(011) +¢,(011) ) 
— 0.089: (g,(111) +¢,(111) +¢,(111) +¢,(111) 
+¢9,(111) +¢,(111) +¢,(111) +¢,(111)) 
+0.054- (¢,(200) +4,(200) + 0.023 (4,(020) +¢4,(020) +¢,(002) +9,.(002) } 
+higher order terms involving d,, ¢., ¢, and ¢., (22) 


in which ¢,(/, m, n) represents a 3p atomic orbital of a chlorine ion at a lattice point 
(1, m,n), i. &, Qyp,(P—Rimn) and similarly g_(/, m, n), a 3p atomic orbital of a 
potassium ion at a lattice point (/, m,n). ,(r) and a,(r) are obtained from the 
above equation through a cyclic exchange of x, y and z. Thus, the approximate form 
of Wannier functions will be taken as 


a,(r) ~0.8-¢,(0) 
a,(r) ~0.8-d,(0) >. (23) 
a;(r) —0.8- d.(0) 


Now, the absorption coefficient in the exciton region can be readily computed using 
(11) and (13), with the following result. 


“_ yoo|S31A, \a,*(r) -grade(0 ; r) dr’, (24) 
7 n=1 J 
where 7 denotes the absorption coefficient, ¢ the light velocity, q the wave number vector 
of an incident light and the conservation laws of both energy and wave number vector 
are given by 


by='E(k=0)—E,, k=27q=0. (25) 


In view of the above situation, we shall discuss in more detail the exciton energy corre- 
sponding to K=0, which is responsible for the peak frequency in the case of neglecting 
the interaction with lattice vibrations. Using (23) for the Wannier functions of a 
valence band and assuming for those of a conduction band to be approximately of an 
S-type symmetry around each lattice point, we can easily see that the matrix elements of 
e*/ry> with respect to the Wannier functions belonging to different branches of a valence 
band and a localized exciton wave function with the same location are to vanish from 
the symmetry consideration. Furthermore, the summation over I or R, of the similar 
matrix elements with respect to the Wannier functions or exciton wave functions of the 


different locations is seen to tend to vanish owing to both cubic-symmetrical arrangement 


—eOelf 
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of ions in the crystal and the p-type character of (23), a pair of matrix elements sym- 
metrically located being canceled with each other. Generally speaking, it will be permis- 
sible, for the qualitative discussion, to disregard the minor contributions from the above 
mentioned quantities in (16) and (20). 

Then, it follows, with neglect of the small quantities involved, 


Een (ii 0) eee ee (Cal SP) +V x) ee (el, nl|e?/r,9| nl, el) 
— >) Sur En Ry — R,) + JE (26) 
UCzl) 
and 


alg carey) (27) 


which shows that the singlet exciton states corresponding to K=0 are nearly degenerate 
and the peak frequency does not split when we disregard the effect of spin-orbit coupling 
and the interaction with lattice vibrations. The triplet exciton state lies lower than the 
singlet one, by an amount of twice the exchange energy of an excited electron and a 
positive hole. As far as the main terms (the second and the third ones) in the energy 
expression (26) is concerned, the transition to an exciton state (E“” (k=0) —E,) may 
be understood approximately as a transition of an electron from a Cl 3p-state in the 
crystal to a localized exciton state, which situation seems to roughly justify the approxi- 
mate validity of the zero-order description of our exciton model. 

For the numerical estimate, the explicit form of the Wannier function of a conduc- 
tion band of KCl, i.e. a,(r—R,) will be required but, unfortunately, we have no reliable 
knowledge of it at present. However, when we roughly assume a 45 atomic orbital of 
Slater type in Cl ion for a,(r—R,), the exchange energy in (26) is found to become 
extremely small, whereas the Wannier function evaluated by a single OPW of a conduc-- 
tion band yields about 0.5~0.8 eV for the mentioned exchange energy. Presumably, * 
its correct estimate will be supposed to lie near 0.1 eV or so. Furthermore, E °°, estimat- 
ed from our previous result on KCI and the half-empirical estimate of a conduction band 
bottom, becomes of —0.5 eV—1.86 eV= —2.36 eV. The values of &,(R,,—MR,)  esti- 
mated from the band width calculated by Howland" and Sj, from the previous work 
on KCI lead to approximately 0.75 eV for >'S,,&,(R,,—R,) and finally we get &, (0)! 
= Entom 3p) +V¥u=— 11.3 eV. It is quite difficult to estimate 4E on a simple way as 
above although it is reasonably expected to yield a minor contribution. Substituting the 


above estimates of the various quantities in (26), we finally obtain 


hy=E” (K=0) —E,~8.3 eV+ dE (28) 


for a singlet exciton peak of Rooirdtr a0" iy atte 
According to the reflection measurement of Hartman, Nelson and Siegfried,'” the 


peak frequency of an exciton absorption in KCI has been observed to become 7.7 eV at 


*% This is the case because of the polarization effect of the crystal in addition to the improvement of 


the Wannier function. 
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‘T=320°K and then the peak frequency has been found to shift toward the shorter 
wave length side with decreasing temperatures. Therefore, the above estimate of peak 
frequency of KC] at T=0°K may be considered to be reasonable, allowing for the crude 
estimates of the various quantities and the contribution from JE. The more accurate 
evaluation of the exciton peak shall be reserved until the Wannier functions belonging 
‘to the conduction band can be obtained more accurately. 

b) Comparison of the present theory with that of Slater and Shockley”. 

Finally, some remarks should be added to the comparison of the present results 
obtained above with those of Slater and Shockley. In the present method of procedure 
to treat an exciton, the Coulomb interaction of an excited electron with a positive hole 
around a halogen ion has been dealt with in the zero-order approximation of the formalism, 
whereas, in the theory of Slater and Shockley, the mentioned interaction has been worked 
in the next higher approximation. The above situation shows that the former method 
of treatment is more appropriate for a small size exciton and the latter one, for a large 
23) 


of type n and wave number 


size exciton, respectively. In fact, the exciton wave function 
wector K may be written, in Slater-Shockley’s theory, as 
pet pa PRR iit B(k.j; kj’) (29) 
ee ae, 
in which B(k,j; k,j’) represents the determinantal function corresponding to a configura- 
tion with an electron with k, in a conduction band j and with a positive hole with ky, 
in valence band j’. The states involved are restricted by the relation k,+k,=K. The 


Fourier transform of Yji'n, 5, may be written 
(N2) pel aad Pm Cet (30) 


where R is the radius vector of a center of mass of an electron-hole pair, r the relative 
position vector of the pair, 2 the volume of a unit cell and N the number of such cells 
in the crystal. /'"(r) expresses the internal state of an electron-hole pair and satisfies 
a Schroedinger-like-eqation in the effective mass approximation. The first factor on the 
right-hand side of (29) represents the wave function of an exciton’s over-all motion in 
the crystal. Superimposed upon this is a modulation produced by the atom cores of the 
crystal, whose effect is included in the Bloch orbitals involved in the second factor on 
the right-hand side of (29). The corresponding energies become, for a simple case in 
the effective mass approximation, 


Bus dpe. a Ge pees 


—* é 31 
2(m,+m,) — 2b%en® (31) 


where JE, expresses the energy gap between a conduction and a valence band, m, and 
m, the effective masses of an electron in conduction band and of a hole in valence band, 
respectively. /s denotes the reduced mass, € the dielectric constant and n= 1, 2, +++++ ; 
The second term in (31) represents the translational energy of a center of mass of an 
exciton and the third the internal energy of an exciton. On the other hand, our wave 
function of an exciton in the present formalism may be written as 
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+o — Ds Aes RAG ed, (32) 
n U 


In accordance with the rather strong interaction of an excited electron with a hole, the 
internal motion of an exciton is not separated out from the influences of the atom cores 
in the crystal but is involved in the determinantal function *Y', although the trans- 
lational part of an exciton is given by the factor, exp(iK-R,), in accordance with the 
situation that the translational motion of an exciton has been taken into account in the 
higher order approximation. The corresponding energy for a special case of a single 
valence band is given by (16). Furthermore, each term of (16) has fairly simple inter- 
pretation on the atomic view-point, as described before. Thus, we may conclude that 
the present method of procedure is reasonably valid for the case of a rather strong interac- 
tion of an excited electron with a positive hole and provides some refinement of the 
original work of Frenkel. 

c) Spin-orbit coupling. 

Now, we shall discuss qualitatively the effect of spin-orbit coupling on the structure 
of exciton multiplet. As a consequence of the approximations we have made, the spin- 
orbit coupling energy” of N valence electrons in the ion core field of the crystal may 


be written as 


N 2h 1 
Hea u > Dil gtadV;, (15) x Py)-oj;+ 2S) [ry X Py) 5, (33) 
4m? Cc j=1 2m c” t#5 Yi, 


where P=6/i- grad, r,;=r;—rj and the other quantities have the usual meaning. 


Then, the Schroedinger wave equation becomes 
(H+ Hs;) -P=HE.-, (34) 


in which H is given by (2). Our wave function is approximately represented by a 
linear combination of the triplet and singlet wave functions which satisfy 2). 


3 
v= >) Pag Pa, Py t ay Py tag Fis}, (35) 


aa 
where 


hay, ere 1 VY jh + Ry Ly (nt,et) 
L, =~ ee Ang aay y. 


n 


n Z Nias (36) 


3 2 Ses Y th +R, 3y7 (nt, et) 
D4 ee Asn Das Py 


n 


3 \ jh +R, 3 nl el) 
SU ep pay Aste, ch , f pe \ 
Z 


n 


[he substitution of (35) in (34) leads, in a straightforward way, to a 12X12 
secular equation, which is so complicated to be soluble at present. Then the exciton 
multiplet is seen to consist of twelve components at most although their energy separa- 
tions can hardly be predicted without the actual solution of the secular equation. The 
relative intensities of the exciton absorption components can be computed easily once the 


= ° uf 3 3 3 
secular equation is solved and the corresponding coefficients “4;, “ay;, “4: and *a_,; are 
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actually determined. In this case, since the singlet wave function is superimposed on the 
triplet one, the dipole transition to the mostly triplet exciton state from the ground state 
will be realized, resulting into the further splitting of the exciton peak. However, it 
should be mentioned that all the exciton multiplet components are not always observed 
experimentally on account of the limited resolution of optical instrument and the tempera- 
ture-broadening. In order to see in more detail the gross behaviour of the effect of small 
spin-orbit coupling in alkali chlorides and fluorides, we shall actually compute the diagonal 
elements of the energy matrix involved in the secular equation or the first order energy- 
shift of the singlet and triplet exciton states corresponding to K=O (exciton peak) by 
the conventional perturbation method. The computations are straightforward and then the 
singlet exciton peak remains unchanged in energy, to this order of approximation. On 
the other hand, the triplet states (K=O) corresponding to My= +1 shift upward and 
downward by an equal amount, respectively, with respect to a triplet state M,=0, which 
is unchanged to the same approximation, M, being magnetic quantum number of the 
total spin. The corresponding coefficients of mixing singlet and triplet wave functions 
become proportional to the matrix elements of H,; with respect to singlet and triplet 
exciton states. 

In their reflection method of measurement, Hartman, Nelson and Siegfried'® have 
actually observed the apparent splitting into doublet of the exciton absorption peak, at the 
temperatures below 150°K, in the cleaved crystals of KCl and NaCl. Their observed 
splitting at 6°K becomes of 0.14 eV in KCl, i. e., about a doublet separation of a 
chlorine atom (0.11 eV). The result would probably be interpreted as follows. In ac- 
cordance with the relatively small spin-orbit coupling energy of the valence electrons in 
alkali chlorides, the singlet exciton peak will practically be unchanged even if we allow 
for the spin-orbit coupling and, futhermore, the splitting of the triplet exciton peak will 
perhaps be too small to be resolved experimentally in addition to the temperature broaden- 
ing. The only sizable effect of the spin orbit coupling is to superimpose, in some mea- 
sure, the singlet exciton wave function onto the triplet wave functions, which will 
presumably enable one of having non-vanishing but weak transition to the mostly triplet 
exciton state from the ground state (singlet state). If the above considerations were 
really valid, the shorter wave length peak of high intensity would be ascribed to the 
transition to a mostly singlet exciton state, while, the longer wave length peak of low 
intensity would be ascribed to the transition to a mostly triplet exciton state. According 
to the equation (26), therefore, the energy separation between two peaks will be ap- 
proximately given by twice the exchange energy of an exciton-electron with a positive hole 
with the same location. Then, it follows 


(el, nlle*/r,|nl, el) = 0.07 eV, 


which, in turn, leads to by,~8.3 eV for a singlet exciton peak at T=—0°K. 


Finally, the authors would like to express their gratitude for financial support from 
Grant-in-Aid of the Education Ministry of Japan. 
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The physical state of unstable particle previously proposed is derived as the state produced in a 
finite time interval on the basis of the scattering theory of Gell-Mann and Goldberger, and also the 
exact state is obtained from the physical state by employing an appropriate limiting process. An 
S-matrix-theoretical calculation method is proposed for the processes including unstable particles. 


§ 1. Introduction 


In the previous paper,” which is quoted as I, we investigated the clothed state of 
unstable particles. By introducing a new mathematical notion, “‘ complex distribution ”’, 
we could construct the exact eigenstate of total Hamiltonian corresponding to the unstable 
particle, which we called “exact state of the unstable particle”. But it being not obser- 
vable, we defined the physical state of the unstable particle by its approximate state. 
Hence the definition of the physical state might seem rather artificial, though it was 
shown that this state satisfies various physical requirements. We did not clarify the 
physical meaning of the rather indefinite function f(w) which appeared in the physical 
state. 

Recently Naito” has investigated the finite-time scattering theory by the wave-packet 
method, and succeeded in treating the production and decay of the unstable particle. So, 
if we treat only the production in a finite time interval, it is expected that we can obtain 
the physical state of the unstable particle. In order to formulate this mathematically, 
we make use of Gell-Mann and Goldberger’s scattering formalism.* We define the physical 
state by the state produced in a finite time interval 1/x, employing the more general 


form 7,(T) instead of their switching function lim &e*”. We can then prove that it 
e£>+0 


coincides with the physical state defined in I apart from the normalization constant, and 
that f(w), which is now represented as a functional of %x(T), satisfies the four condi- 
tions postulated in I. 

Further, we demonstrate that for «0 the above physical state tends to the exect state 
defined in I when 7,(T) is chosen appropriately. The former thus is a good approxim- 
ation of the latter for appropriate 7,(T) with small x. 

It may be natural to use the physical state in the calculation of matrix elements re- 
lating to unstable particles. But it contains a rather indefinite function f(w) which cannot 


— 
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generally be controlled by experiments, and so the calculations become futilely complicated. 
We therefore propose to use the exact state instead of the physical one in the calculations 
including unstable particles. According to this method, we can treat transition matrices of 
the decay or scattering of unstable particles in the same way as in the usual S-matrix theory 
of stable particles. This method thus permits to treat unstable particles by the S-matrix- 
theoretical formalism. 


§ 2. Physical state 


In this section we employ Lee’s model’ for the sake of simplicity. Notations are same 
as in I throughout this paper, unless otherwise indicated. 

According to the scattering formalism of Gell-Mann and Goldberger, the outgoing- 
wave eigenstate is obtained by the following adiabatic process : 


1. ee Pae 
1( my—W,) a,*|N). (2-1) 


0 
|NO(p)*)=lim| dT &e ¢ 
é>+9J —o 
This represents a superposition of the incident waves emitted in various times at the pre- 


sence of interaction. Quite analogously to this, let us consider the following state : 


9 1(H— Die 
| Leis Hm ey (2-2) 


which represents the superposition of the V- 
nx(T) particle amplitudes produced with intensity 
7,(T). Actually, the physical V-particle is 
produced in a finite time interval, and so the 
state Y" defined above may be regarded as 
the physical unstable state. Here we assume 
that the function 7,(T) roughly takes such 
a form as Fig. 1. As « must be finite, 
Bip generally depends on the form of the func- 


tion 7,(T). The usual choice 
Fig. 1: om (T) =xe"” (2-3) 
is not preferable, because it has the following defects. 


i) It has a finite tangent at the final time T=0.* 
ii) It does not damp sufficiently rapidly for T—— oo. 


The meaning of these respects will be clarified later. E,, which represents the rest energy 
of the V-particle, might be m, for the bare state, but we should use here the renormalized 


mass as E,, as is well known. Namely, we put 


E,=my—iy/2 (2-4) 


* This physically means that the production does not become stationary at the final time. 
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which is the pole of the modified propagator S,/(E). Hence 7, (T) must damp faster 
than at least e™”” in order to make the integral (2-2) converge. In order to calculate 
(2-2), we insert into it the complete set”? 


| Nd(p) ~ =9(w,) op (my+w,) |\V) 


all 0(k—p) —9(a,) Sy’ (m+ emp) Le) ___ lagi) 
Wp— Wp —1E 
(2-5) 
with 


We then obtain 


0 i(mx- ,-E i 2 
we—\y,(T)aT| | dp p(a,)e ( N +o, 0) IV) 
1 my -—E Ss 
+ |ax J (wx) 5,/* (my+w,)e veils ; a;,*dk|N) 


i(my+o,—E,) Tl d(w 
(my +,—E) | Thou) ers dkINY |, (2-6) 


OW, Wp t 


=a ap: (wy) e 


where 
e (Wp) =ag* (w,) S,/* (my+ Wy) Sy’ (my+ W,) 5 (2-7) 
By using Lehmann’s spectral representation” 
Sy'* (my + wx) m= dp i (w,) ; (2 q 8) 
Ox— Wy — tE 
(2.6) can be rewritten as 
y* eel + ap fp (w,) | F (awa, Wp) 9 (wx) a,,*dk| N) - (2 . 9) 
where 
c =\¢p (ay) | dT 7,(T) Pa i ak (2-10) 
and 


0 atime eas Be TS, i(my +o, —Eo)T 
en ay) =] 9 (T)AT hat Vardlewta S cs. es (2-11) 


On Wy 


F(x, wy) is regular at w,=«@, and so —i€ in the denominator is unnecessary. F (w,,w,) 
is rewritten as follows. 


0 
F (cy 0») =—i| 9 (T) dT 


ita top BT i i(@z—@y) © 


€ 
r 


i(my+@,—Ep) (T—1) 


i" i(my+ox—Epo)t 
—ile F iy od 


—o 


dz | (Te 
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22) 


—-o 


PbGe, if ¢7 (my +a,— Ep) "ae Eh t+orz—Ep)t 


—-o 


Substituting (2.12) in (2.9) and integrating by parts relating to t, we obtain 


0 1 < aS 
P* =|) — \ap p(y) ar cop Sasi A Re 8) 
=e 4 my+a,—E, 


o i(m -—E 
| m2) — (de 2 (my +ox pl T+2) fat |N) 


—-—o 


= ¢ 17) |—S4oo —(1— F(an)} extdh| >| (2-13) 
where . 
flor) i GO) dt (2-14) 
with 


i(my at Opn Eo) 1h 


heme ena fr wer ay 


Here the function 4(t) has the following properties : 


+o 0 +o t/2 SG a 
i) [A deme* lap poy) (AT | oe(T)+{G/2)€ 0 ga T defer? 
J—-o 0 


i(my+o,—E)T 
1(myn Vp 0) a0) Catz) 


ro 
ii) h(0) =<"'ldp P(w,) jar qx (T )e 
Because 7,/(T) is a sufficiently smooth function and has the values appreciably different 
from zero only in the neighbourhood of T=—1/x, and so the integral approximately 
vanishes due to the rapidly oscillating factor. For h’(0), h’’(0), etc., the same is true, 
provided that 7,(T) has the higher derivatives. 
Now the final expression of /”, (2-13), has completely the same form with the 
physical state |V% given in I, apart from the normalization factor. And, further, we 
can prove that f(«w,) actually satisfies the four conditions postulated in I. 


i) f(a,)~1 for |jax+my—my| <k. 
Because we then get f(«,) ~| h(t)dt~1 from (2-16) for 7<k. 


ii) f(a) ~0 for |w,t+my—my|>« where «, is real. 
Because, f(w,) then vanishes approximately owing to (2-17) for the contribution 
from t-0 and to the rapidly oscillating factor for that from the other part. 
iii) f(w,) is regular near the real axis. 
The analyticity in the lower half-plane is obvious from (2.14). Further, the 
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Fourier transform h(t) nearly vanishes in the neighbourhood of t=0, and thus 
flw,) is regular near the real axis. : 

iv) f(w,) vanishes far away in the lower half-plane. 
This is obvious from (2-14). 


Thus the physical meaning of the parameter « in |V1) is the inverse of the production 

time, as was expected. 
Finally, it is of interest to consider the case of the simple but unpreferable choice 

n.(T) =xe*?. In this case, the integration (2.11) is easily carried out, and the result 


1S 


r= |V)—| I(r) a,,.*dk|N) (2-18) 


gk 0) E, = 1K 
which is nothing but the state proposed by Glaser and Kallen.”* Then f(w,) becomes 
flo) = —ix/(my+a,—E,—ik), (2-19) 


which decreases only linearly at a great distance and has a pole w,=E,—my+ix near 
the real axis because of 7,’ (0) = finite. 


§ 3. General case 


In this section we treat the general case. As in I, we write H=H,+H,, and denote 
the eigenstates of H, by |V) and |n). Then the eigenstate of H, |n*), is given by® 


|n*> = —i€ (H—E,,—i€)—"|n). (3-1) 


It should be noticed that the state (3.1) is not normalized when incident particles have 
their clouds.” We, nevertheless, treat it as if it were normalized, since this matter is 
not so important. The correct renormalization will be discussed in Appendix. Rewriting 
(H—E,,—i€)~" in terms of U-matrix according to (4.5) in Araki et al., we obtain 


|n*) =(V|U(E,) |n)Sy! (Eq) |V) +]n) — cn PLEA nd I) 


rc 
m—E,—ié 


—(VIUE,) n)Sy/ (Ey) (dn VE) Diy (3-2) 


E,,—E,,—ié 
Here (V|U(E,) |n) corresponds to y(w,) in Lee’s model, in which the third term of the 
right-hand side vanishes owing to the special type of interaction. Hence the derivation 
in last section does not hold as it is. We therefore newly put H= H,+H,, where H, 


is the decay interaction part of V-particle and H, is the other part. Using H, instead 
of H,, we have a similar expression as (3.2), but then the third term is removed. We 


* In their paper, the sign of x (=A) is indefinite, but here it is limited to ck >0. 


A Theory of Clothed Unstable Particles. IT 827 


denote the corresponding quantities by affixing the symbol ~. The calculations in 
last section can be straightforwardly repeated. But it must be noticed that the numerator 


of the integrand of the fourth term, (m|U (ER) \7 ), depends not only on m but also on 
E,. Namely, this E,-dependence becomes a drag on factorizing c in (2.13). Hence, 
if we assume that (m|U (E,,) \V ) does not strongly depend on E,, and so the main con- 


tribution comes out from the neighbourhood of E,,=E,, then we can approximately replace 


it by (m|U(E,)|V) and finally obtain 


wnzd (P)— [dn MEH) (3 ¢(B,)} 4) |. (3-3) 
: E,,—E, 
Now, H, is not necessarily equal to H,+H,. For example, for the weak interaction 
of ANz the part A@2N+7 belongs to H, but the part 1+72N belongs to Ay olf 
such an unnatural division of H is not desirable, we should put H,=H,+H, and neglect 
the contribution of the third term caused by H,,—H,, which is indeed very small. 
These approximations employed above become better for smaller «, and also depend 
on the form of the function 7,(T). For instance, if 7,(T) =«e*", the approximations 
are not so good even for small «, and it is not allowed to make « smaller than 7/2. 


But if we adopt a very rapidly damping function like 
(ADS Ceifne es (3-4) 


then the approximations speedily become good. Further, in this case, we can define the 
limit «0, and &% tends to the exact state defined in I, as is expected. We prove this 


in the following. 
As a preliminary, we calculate the limit of an integral, 


[= lim n(T)e® dT, (3-5) 


A>t+0J — 


where E is an arbitrary complex number, and 7,(T) is given by (3.4). Carrying out 
the integration, we easily obtain the following result. 

ip al —Oatordim Eb <0. 

ii) I=0 for Im E=0 and |Re E|>|Im E]. 

iii) I=1 for E=0. 

iv) otherwise [=indefinite (or infinite ) . 
Next, making use of this, we evaluate a functional 


+O. 1(B— Bo) I 
e 


Ig |=lim ( (T)dT| | o(E)dE. (356) 


Here E,=m,—iy/2 and my—M > 7/2+0, 0>0 (infinitesimal). M physically represents 
= 
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the threshold energy (M=my+yp in Lee’s 
> model). @(E) is a meromorphic function in the 
YY) lower half-plane and regular in the domain D 
except for E=E, at which it may have a pole 
of the first order. The domain D is defined by 


M 


|Re E—m,| <j /24+9, 
d>Im E>—(7/2+8). (3-7) 


G 


Fig. 2. Denoting the distant, large semi-circle in the 


lower half-plane by C, we can write 
_ -2\ i(E—E,)T _—. i(Ey-E)T 
ioetefemar|((z[7 "Momma! s] 
(3-8) 


where R, stands for the residue of g(E) at E;. The contributions from other than 
E=E, all vanish on account of i) and ii) presented above. We thus obtain 


I\g|=—2zi lim (E—E,) ¢(E). (3-9) 
E>E» 


Now, we calculate the limiting state, which stands for the idealized physical state 
produced in the infinite time interval, 


P=lim y* 
0 i (E,,— Ey) T 
=lim| 7, (T)dT (dn Sg. . (n|U* (E,) |V)S,’* (E,) |n*) , 


(3-10) 


where |n*) is given by (3.2). We analytically continue the integrand to the lower 
half-plane relating to E,. We then know that S,’(E,) has a pole of the first order at 
E,=E,, and assume that S,,’(E,,) and U-matrix elements have no other singularities in 
the domain D, Then (3.9) assures that the second and third terms of (3.10) where 
|n* is expressed by (3.2) vanish, and we obtain 


| vs 
v=c| IV) — fans si im | (3-11) 


using the notation of complex distribution.* Here 
c= 281Z,-Sy/* (E,) |dn 0(E,—E,) (V|U(E,) |n)(n|U*(E,) 7) (3-12) 
with 


V dae (0/0E) [Sy (E) jie ne k= [lim (E—E,) Sy (E) : he (3 +1 3) 


* Here the field operators are, of course, generalized to the complex distributional sense as in I. But, 
rather, we should understand this procedure as the analytic continuation (relating to E,,) of [(E,,) (m|n*)dE,,, 
where ¢(E,,) is an arbitrary regular function. 


aah 
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[5,/* (E,)}?=2 (E,) —2* (E,) = V|[U(E,) —U*(E,) |W) (3-14) 
and 
IME USE) Ht — Ati 2) Hi, || (iy EB) — (Hy — EB) 
-(1—A) HI 14+ (—A) (A,?—E) 7H, ]* 
= — 2miU* (E)0(H,—E) (1—A)U(E), (3-15) 
we find 
Cha (3-16) 
(3.11) proves that VY exactly coincides with the exact state |V) defined in I apart 
from the coefficient Z, Though this result is the expected one, it is important that the 
relation between the time integral and the complex distribution of energy has been clarified. 


Namely, roughly speaking, the following conditions are necessary to yield the expected 
complex distribution. 


i) The switching function 7,(T) must damp very rapidly at the remote past, that 
is to say, the particle must be created more quickly than its decay. 

ii) The threshold energy must be sufficiently distant from the level of the unstable 
particle in comparison with the level width (my—M S72). 

iii) There must be no other unstable levels near my—iy/2 on the complex plane. 


When these conditions are satisfied, the approximations employed in the derivation 
of the physical state are well justified. Indeed, for 7,(T) presented by (3.4), for exa- 
mple, the integration on T yields a factor exp| — {(E,—my)*— (7/2)*} /4«°] approxi- 
mately, and so the contribution from |E,—my,|>« becomes very small. Thus it suffices. 
to consider only the contribution from the neighbourhood of E,=£, in the integration 


on £,. 


$4, S-matrix theory of unstable particles 


In the conventional theory the decay amplitude of the V-particle is calculated by 


(n- V >), (4-1) 
where |n~) is the incoming-wave eigenstate of H, while the S-matrix is defined by 
5, (nh nt* ), (4-2) 


The asymmetry between the above treatments is obvious. The former is the transition 
amplitude from t=0 to t=-+ oo, while the latter is that from t=— 00 to t=+. If 
we force ourselves to use the S-matrix for the decay in the framework of the conventional 


theory, we have 


S, p= —271-0(E,—my) (n-|H,|V), (4-3) 
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which exactly vanishes,” though the perturbation calculation yields divergent results (so- 
called “ infrared-catastrophe-like divergence”) except for the lowest order. 
Now, we have proposed in I that the decay amplitude should be calculated by 


(n-|VYY. (4-4) 


But the calculations using the physical state are generally very cumbersome. And since 
‘we cannot accurately control the production rate 7,(T) experimentally, detailed calcula- 
tions are meaningless. Hence we propose to calculate the decay amplitude by 


N(n-|V), (4-5) 


‘where N is a normalization constant which is determined later, instead of (4-4). The 
approximation used here is very good as stated in § 3, though, of course, such a replace- 
ment as I (4-1) should be made for complex distribution in the evaluation of probability 
(cf. I (3-11) and I (4-9)). As was proved in § 3, (4-5) can be interpreted as the 
transition amplitude from t=—co to t=+0o0. It was shown in I that |W) is ap- 


parently orthogonal to |m*). When we introduce the “conjugate state of |W)”: 


rey eee {n|U* (E,*)|V))_. : 
IP) =IV) \a eee ad (4-6) 
with HIP) =E4|V), 
which is apparently orthogonal to |m~), we have the “‘ extended S-matrix ”’: 
{n-|m*» Nin“ |V) 
(4-7) 
\ NCV*|m*> N*<V*|V) 
We can easily prove 
es La? Te! Sor (4-8) 


where Z, is defined by (3-13), by means of the same technics as 1(5-12)~(5-13). If 
we require that the last element of (4.7) is equal to unity, we obtain 


N=Z," (4-9) 


as the normalization constant of |W), accordingly, that of LO 50 ts INF ZO, 


The (n-V)-element of (4.7), Z,'"{n-|V), corresponds to the decay amplitude. We 
can now rewrite this in the R-matrix form” : 


Z, in| Vv) =—271- 0(E,—E,) rae eb | Vv» 
= —271- 0(E,— E,) Z.?¢n|A [1 poe qd —v, A) Oo Ne aa E,) “U(E,) \|l7) 
= —27i-0(E,—E,)Z,'(n|U(E,) |V). (4-10) 
We thus see that the decay probability to |n) is given by 


reso 54) ie ale 
(E,,— my)? + (7/2)? |Z, ¢ lU( L 7/2) |V>|?. (4-11) 
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The total decay probability should be approximately normalized to unity, 
Nalieeoe 0 because this approximately represents the decay probability of the physical 
state. Hence we have 


V 
7/228 | dnd (E,—my) |Z] (om) PE (4212) 
t : 
Ned 
N \o for very small 7. And we find that the transition probability per unit 
. time for 7-0 is 
Fig. 3. 


270 (E,—my) |Z,'?(n|U (mp) |V)|?, (4-13) 


which is a well-known formula’? apart from the Z-factor. 

Finally, we briefly discuss the scattering problems including unstable particles. The 
perturbational calculation of these processes usually lead to the so-called infrared-catastrophe- 
like divergence in the conventional theory. For example, consider the N-# resonance 
scattering in Lee’s model. Though the exact calculation of (Nd (p’)~|N0(p)*) does 


not give rise to any divergence, the free propagator of the V’-particle is 
S,(E) =1/(E—my +i€) (4-14) 
and the integral of its absolute square is divergent when the V-particle is unstable. Hence 
we should use 
Sp (Ey 1/ (EE) (4-15) 


as the free propagator instead of (4.14). 

The scattering of unstable particles can hardly be treated by the conventional theory. 
Then we apply to this problem the method of the extended S-matrix proposed above. 
For example, the S-matrix for the V-( scattering in Lee’s model is defined by 


(v80(p')- FOP) (416), 
aN V 
where : 
H|V0(p)*>=(E,+,) |V@(p)*) i \ 
and \ IN \ 
H|V*0(p)-)= (E,*+0,) |V*0(p)~)- i 
In the perturbational calculation the propagator of the N-particle in V ‘ 
Fig. 4 is Fig. 4. 
Sy (my— Op) = li (my—w,p—my+i€), (4: iz) 
which leads to a divergence, while in the present method it becomes 
Sy (E,— wy) = ie (E,—o,— my), (4: 18) 


and finally is replaced by 
{1—f(w,)} / (E>—@p—™y) - (4- 19) 
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Thus we can treat the scattering problems of unstable particles. 
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Appendix 


The state (3.1) is not normalized, 1.e. 
{n*|n*)=Zi, 
where Z, is the product of the Z-factors of incident particles. Accordingly, the com- 


pleteness should be revised as 
[atu] Zn" foi. 


Hence the integrand of (3.12) contains an additional factor Z,~". The calculation (3.15) 
which is usually accepted*) is no longer valid. To understand this, consider, for example, 
its fourth order terms of perturbation : 


H[(1—A) (A, —E) 1H, P— H[ — A) (A, —£) “HF 
=H, (1—A)| (A)? — E)*— (A) — E) “JH, (1— A) (A — E) “AF 
+ H,(1—A) (H,\ — E) 71H, (1— A) [ (H. — E) = (HP —E) *")- 
H,(1— A) (H,“ —E) HA, 
+ H,[ (1—A) (H, —E)-'H, F(a — A) [ (HW) — E) 7— (A, —E) 7], 


If (1—A) H,(1—A) (H,| —E)~'H,(1—A) has diagonal elements (i.e. self-energies), 
then the products (H,°—E)~'---(H,“—E)~ appearing in the intermediary terms of 
the right-hand side are not well defined. Thus (3.15) is inadequate. 

We will investigate this problem by means of the following concrete example 
(“doubled Lee model’’) which is exactly soluble. The Hamiltonian is given by 


A=myhy* y+ m A Yy* Jy+tmy diy* a+ \ oO *0,.dk 4. Joos O;.* On dk 


+ \7 (cox) A rp ry Oy + yy*O,.* | + | fox) dk by* hardy + Sra Ok |. 


Not only the V-particle but also the N-particle have their clouds. Their modified pro- 
pagators respectively are 


[S)/ (E) 1 =E —my'+i&— |g" (an) Sy! (E—«,) dk, 
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(‘Su EB) PSE mis 5 7 (E), 


where 


seins (re opel wh) 
we) (lapse oar: 


The elements of the U-matrix are as follows. 
(VIU(E)|V) =| 9" (wx) Sy! (E— 04) dk, 


V|U(E) 4,*|N)=9 (wp) + (E—my—a,+i€) Sy’ (E—o,) , 
(N|Op, U(E) Ay*|N) = 
— (V'y(E—a,) — my) (E~my— wy +18) Sy! (E— ey) (P= Pp’), 
(M0640 (E)0,,*|N) = flo) (E—my— 0, +18) Sx! (E— 0) (pp), 
(V|U(E) 6, *bx*|M) =9 (@,) flex) Sx! (E— 0p). 
Hence we obtain 


(1—A) (my + w,—Hy+i&) “U (my + 0,) Fp*|N)Y 


=Zx(1—Z5!)0,*|N) + Zr} ED, < bn*butdk|M), 


v—My—o,+t 


(1—A) (my +0,—H,+i€)71U (my + w,) AS)! (my + 0,) U(my+a,)9,*|N » 


=Zy9 (cy) So (my+o,) || 9 (ug) Sw! (tt y-+ toy — cog) O*dke|N’) 


+ [foe flaw) Sx Orr t ee ee) 9, 84hb, dhe) | 


My + Wyp— My — On Oy +1€ 


where Zy=lim (E—my) Sy' (E), 


L>m x 
and thus from (3.2) 
w|NO(p)*>=9 (op) Sy! (my + op) |V 


ae {ia (k—p) +9 (o>) Sy! (my+ Wp) J (wg) Sw! (My + Op— wp) |O.*dk| N 


+([Ree [O(k—p) +9 (ep) Sy’ (my + Oy) J (wr) Sy! (my +a,— Ox) | 


my + Wp— Ny —fwas= Outs 
X O;* de by.*dke'|M 
and 
(NO (p')*|NO(p)*)=Zv- 3 (p—Pp’)- 
Now, we compare the both sides of (3915). 
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where 
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(V\[U(E) — U*(E)]|V)= 2i\ oon) ImSy/ (E—w,) dk 


pe Pe (72 (w,) (Ly 0 (E—my— ay) +0 (E—w,) \dk, 


d 


—27i(V|U* (E)6(H,—E) (1— A) U(E) |V) 


e 


= — 2zi| F (wz) [ZO (E—my— ow) +7 (E—w,) \dk, 


o (E) ==|Sy! (E) P| £2 (cop) 8(E—my—ey) dk. 


‘Thus the first terms of the both do not coincide. The two corrections in (3.12) and 
in (3.15) are cancelled each other, and hence the final result (3.16) is unchanged, as 
it should be. 


10) 


11) 


N. 
K. 


M. 
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Using the dispersion relations, the sum of the total cross sections for the xz*-p and zx -p 
interactions are evaluated at very high energies, under the assumption that the cross sections are 
constants above a certain very high energy. The numerical value of the sum of the cross sections is 
determined mainly by the empirical data at low energies (less than 2Bev). The analysis gives the 
sum of the cross sections a few times as large as twice of the geometrical cross section obtained by 
assuming that the “radius” of the nucleon is the pion Compton wavelength. Since the dispersion 
relations seem to be consistent with the experiments at low energies, this strongly suggests that the 
dispersion relation would break down at very high energies. 


§ 1. Introduction 


It is very interesting to investigate the consistency between the dispersion relations 
for the pion-nucleon scattering amplitudes and the experiments, because the inconsistency 
between them means the break down of at least one of the fundamental assumptions 
which are necessary to write down the dispersion relations, for example, the micro-causality, 
the charge-symmetry, the conservation of the probability and so on. At low energies 
this problem has been studied by many authors.) The experiments seem to favour the 
dispersion relation, though the conclusions are not yet decisive. The reason is that the 
dispersion relation involves a principal integral and then the small experimental errors yield 
large ambiguities” as long as we restrict our attention only in low energy regions. Thus 
it becomes very interesting to analyse the dispersion relation at high energies. Further- 
more, it is very likely that the discrepancy would appear at high energies if the dispersion 
relation were inconsistent with the experiments. 

In this paper we investigate the total cross section at very high energies (> 30 Bev), 
using the new dispersion relation given by W. Gilbert,” under the following assumptions ; 

I) The sum of the total cross sections becomes a constant at a certain high energy, 
say Ly. 

II) It changes linearly as the energy varies from 2 Bev to Z,,. The second as- 
sumption is introduced just for facilitating the calculation, so not an essential one. The 
integral to be calculated is fairly convergent and the contributions to the integral from 
this energy range are very small. The first assumption is plausible for two reasons ; first, 
the energy dependence of the total cross section is theoretically limited by the relation.” 


o(w) ecw", —1Xn<l, 
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at high energies. Secondly, empirically it seems to be a constant (~30 mb) above the 
energy about 2 Bev.” 

Following Gilbert, we can express the sum of the total cross sections for the 7° —p 
and 7~—p reactions at a certain high energy, say Zx, as the sum of the coupling constant 
term, the cross sections at the threshold and the convergent integral involving the real 
parts of the scattering amplitudes.” Under the assumptions the integral can be evaluated 
if we use the ordinary dispersion relation to obtain the real parts in the energy region 
where the empirical phase shifts are not available (above 300 Mev). This partial use of 
the ordinary dispersion relation is the essential point of the present paper. Here it should be 
noted that we must not make Z, infinite before performing the integration, otherwise we 
would obtain only the vanishing cross sections at the infinite energy. The reason is that 
we have only the identity of the cross sections at the energy Zx, if we substitute the 
integral of the imaginary parts for the real parts in all the energy regions. Then, to 
make Z x infinite before the integration means to discard the one side of the identity.* 

The sum of the total cross sections thus obtained is fairly insensitive to the choice 
of Zx but depends on the energy Z,,, where the cross sections become constant. Never- 
theless, it becomes a few times as large as the corresponding one obtained by assuming 
that the “ radius’ 
This strongly suggests that the dispersion relation would fail to explain the empirical facts 


’ of the nucleon were determined by the meson Compton wave-length. 


at very high energies. The ambiguity from the experimental data does not seem to upset 
the above conclusion. The calculational method is outlined in Section 2. In the final 
section the numerical results and the concluding discussion are given. 


§2. Derivation of the sum of the total cross sections 


We start from the new dispersion relations given by W. Gilbert” 


(ott o" Jee (o* to"), EE) | AEE A 2 pl sk RP | 
Kk ome JSF F—1(e2—z")!’” 


with 
b=1/2K, BG=47f*, 
where z is the laboratory meson energy,** f is the unrationalized pseudovector coupling 


constant, and « is the rest mass of the nucleon, o* and o> are the total cross sections 


of the reaction ™*—p and z-— p, respectively. The scattering amplitude D*, even in the 
meson isotopic indices, is normalized as follows, 


ImD*=7E(o*++o-), (2) 


* Vavilov’s calculation of the cross section is entirel i i i 
y nonsense in this res: P. V..Vi Joely. Le Be 
(USSR) 32 (1957), 144). vs ere 


Throughout in this paper we use the unit }=c=p=1, where u is the meson rest mass. 


. 


— 
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‘where 7 is the center-of-mass meson momentum and E is the total center-of-mass energy. 
A slight modification to the integrand of the principal integral in (1) enables us 
to make z’ unity. Changing the variable z into €=(z?—1)', we obtain 


(o++o—-)¢=(ot+o-),—— 25°C" : 
(1+1/é 
oe ely fe $ 
ap | ee {Re D*(¢) —ReD*(0)}. (3) 
TK o2(g2— 


We use the same notation D‘(¢) to represent a new function D*(z=// +1) of the 
variable ¢. The integral is no more singular at ¢=0. It is our purpose to check the 
consistency with the empirical data by calculating the right hand side of (3) at the very 
high energy ==K. 

The main problem in this section is how to express the integral in (3) with the 
experimental data. Since the empirical phase shifts are available only at low energies 
(<300 Mev), we separate the integral region into two sections. 

Region A: O<s<¢,. If we choose €, being about 3, we can directly evaluate the 
integral with the experimental phase shifts. The integral in this region is 


a 2 45 pepe(£) = Re D* 
L=+ | eaneyey RPO -RDO} (4) 


— 


aK 


Region B: E,<¢< co. In this region we can obtain the real part of the amplitude 
by means of the usual dispersion relation” 


where we abbreviate the sum of the cross sections as 
f(S) =ImD* (F) /Ko = (o* +07) . (6) 
If we substitute (5) for the integrand of the relevant integral, the contribution in this 


range becomes 


2 i : tng Sabas, (a deb ache 
ne = 4b°G? al : aes is ye s kK2p a? - arte’). (7) 
ee per) ey ae Re OMS 82) 
ay, “L 


The first integral of (7) is easily calculated and gives 


i" 4B°G? ps d= ts 4b2G? 1 | ean! ¢,—--2 2 |e) 
Bet ie (@—K*) (#41) me 1+1/K* gla 


08 


a ==In(K—€,)/(K+€;). 
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The second integral, 


dé P| dé’ 


t= — RP | ae 


rT 


needs a special consideration because of the double principal integration. 


(9) 


We separate the 


region of the double integration into several sections, which are shown in Fig. 1. 


a) €; (€;~K—é’) and (K+e&’~oo), &; (O~&;,—8&) 

b) &; (€/+e~K-—é’) and (K+&~00), &3 (€,—€~Ex +8) 

c) &3 (€,~&/—8), (6 +e~K—&) and (K+&~oco) 
&’; (€, +e~Fn—&) 

d) &3 (€p~€/—s8), (&/+e8~K-—&) and (K+e&’~o0) 
&; (€xa—e~Fu +6) 

e) &€ 3 (€y~&—s), (&+e~K-—e&’) and (K+&~oo) 
&; (En te~K—s—-<&’) 

f) & 3; (€y~é’—s) and (K+é&’~co) 
&; (K-&-—&~K-&+&) 

g) &€ 3 (€u~&’—s) and (€’+E~o0o) 
&’; (K—e+e’~K+8-&’) 

h) &€ 3 (€y~K-—é&’) and (&/+&~oo) 
&; (Kt+e—e’~K+e+2) 

i) € 3; (€y~K-—e’), (K+e’~&’—8) and (é’+e~oo) 
&; (K+e+e/~oo) 

j) €3 (€r~€n), &3 (Ex +e~co) 


Fig. 1. ness integral regions of the double integral (9). The small gaps 
€ ale & are made to vanish after the integration is carried out. We make 
€>€’ here, but the result is indeperdent on how to make the limitation. 


=. 
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The principal integration means to make € and €’ vanish after the integration is performed. 


Throughout the calculation we always put 
E> eé! (10) 


for convenience sake, since we can obtain the unique result independent of how to make 
the limitation. 
After careful calculations, we obtain as the contributions of the integral (9) in each 


sections 


i me 


| me ty 5h es - 
ye 18) 2 =f de"f(") { ; de'f(¢ Ad sin Tie 
Wee ) 1—¢$”?/K? ) é (1 — €2/K2) S : 


ile ute [ea -6( Spar (iia} 


1—¢,?/K = 
l,=0, (11b) 
dz! (¢’) ie ds’ f ( =) Bl ee 
2, { @ SES i e fs ge4e, 
Te od - = = ina = 
“it nm K | 1—¢”/K? sz | s’(1—¢?/K*)  s’—<¢, 
Bg Sypt6 
2f(Fr) le 1 =a ae i (11c) 
oe ae 1—¢,2/K? a ¢,+0 ( 
Izg=0, (11d): 


ME 8189" [ @in7* ane +26(1) ~46(=# B) 


= 
a. p( ee | (11e) 


-F(AS 


F520, CLLE) 
Tyy= +3 (UK), (11g) 
T.a=0, (11h). 
I= 1 [ - 2G (1) +46 #2) 4 7(* SH) — FAS) 


eae (In(K—€,))*— 1 (in( R$) In2K |, (111) 
2 


K Be er SPE fe 
1, =F | _9(p—a) + — 4601) +4G( 2h )+F( 1) —F (x) 
4 F(m) —F (%)) 4 -a?-+a In =| (11) 
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where 
pa é ae x” ye gent +1 
<a HA F = SI a .' G(x e 
cs ie Pe «) n=l n a, oy i 
sat, gee Koa, 5 cckd See X= bat : 
To Kose a 2 5 K-—<s, Kis, Koee, 


The suffixes a, b,::---- , j correspond to each integral region in Fig. 1. 6 in (11a) and 
(11c) is the small interval of the integration over €’ at €’=¢, and it is introduced for 
the sake of the practical convenience to the calculation. For the integrations in the 
regions where &/>¢,,, we have used assumption I. By using (4), (8) and (11), (3) 


is now reduced to 


f(K) =f(0) +L,4C0-9 


-6 SH . 


al \ ae, In 


1 5 K—é, me od i 
pee 4G( “\_F, PAAR, Pelle ee a’ | K), 12 
+| a = tekst Be Harland f(K) (12) 
~where 


Here we used the fact that G(1)=7°/8. There is no singularity in (12) at K=€,, 
‘since the terms containing 7 have all cancelled out in (12). 
It should be noted that (12) must be reduced to the identity 


f(K) =f(K), (13) 
if we make ©, vanish, because it means that the real part Re(D*(¢) —D*(0)) is sub- 
stituted by (5) over all the energy regions. This requirement is indeed satisfied in (12). 
It must be emphasized that it is necessary to partially substitute (5) in order to obtain 
a physically meaningful result. 
Two integrals in (12) need the cross sections above ~ 15, say Sy, which corresponds 


to about 2 Bev of the meson kinetic energy in the laboratory system. Then, in the 
energy range $y <¢<¢), we put 


fOagte “=| FE a) FEW) E+ 8 fla) —SaeflE |, (14) 


according to assumption II. Applying (14) to the relevant integrals in (12), we gain 
the final results, 


f(K) =f(0) +,+C— (H,+H,+H,4+)—D 
+ {1— (4+B)}f(K), (15) 
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where 
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and F(x;) is abbreviated as F,. 

The first four terms in (15) are completely determined by the experimental data 
available today. A, B and D are the high energy contributions and depend on our as- 
sumptions I and I]. B and D represent the contribution from the region fo oS, 
and especially depend on assumption II. Fortunately, these three are, anyhow, very small 
compared with the other low energy contributions. Then, our conclusions in the next 


section are fairly insensitive to the assumptions. 


§ 3. Numerical results and discussions 


The first four terms of (15) can be obtained from the empirical data. f(0), the 
sum of the cross sections o* and o~ at the threshold, is determined from the S-wave 
scattering length by Orear,” which gives 


f(0) =0.42. (17) 


C, the coupling constant term, does not depend on the energy K, which is chosen as 


as 100~©o in the present paper. The result is 
sal 2997 6? S — 0:24, if f?=0.08. (18) 
which involves the real parts of the scattering amplitudes, the 


To the integral [,, 
(3, 3) P-wave amplitudes contribute mainly, since the most of the S-wave amplitudes are 
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subtracted in the integrand and the other P-wave amplitudes are small in magnitude and, 
further, nearly cancel each other. The result is fairly independent of the choice of K 


and gives [i= 6.29. (19) 


Anderson’s empirical formulas presented at the 6th Rochester Conference” are used to 
obtain (19). The value of [, is not insensitive to the data at very low a = 
Fig. 2), where there are certain empirical ambiguities. However, the oa in this 
paper are not upset, unless I, reduces by about 30 percent from the value given in (19) 
(see Table 1). As is seen in Fig. 2, this is not to be expected. 


Fig. 2. The integrand of J, 


The solid curve is obtained from Anderson’s empirical formulas, and contains the 
contributions from the small P-wave phase shifts and S-waves. The broken curve 
is calculated from Chew-Low’s static theory, using the formula by G. Puppi et al.” 
The experimental plots contain only the (3, 3) P-wave phase shift. 


H,, H,, H, and /” are calculated using the empirical total cross sections below 
~2 Bev.” Each of these and the sum J are given in Table 1 with several choices 
of §,, and K. The errors are expected to be less than a few percent as a whole. 

A, B and D represent the contributions from high energies (>¢y). These values 
depend on the choice of ¢,,, but fairly irrelative to the value of K. The magnitudes 


are, however, very small compared with the quantities relating to the low energy data. 
The values of these are also given in Table 1. 


Rewriting (15), we see that 
f(K) = {(f(0) +14+C) — (d+D)} / (A+B). (20) 


Since (A+B) is a very small positive quantity, we must have a very small positive 
difference, {(f(0) +1,4+C) —(4+D)}, (21) 


to obtain a reasonable value for f(K), which would be expected to be about 3 (60 mb). 


= ye 
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Table 1. The numerical values of the quantities in (15) and (16) with several choices of En 
and K. &) and &y are fixed to 3 and 15, respectively. 


Case | Ey K A B A+B H, | ; Hy 
a 15 200 0.0815 | =: [0.0813 | 276 ae 
b 15 0.0814 ue | 0.0814 BS LD 
c 100 100 0.0122 0.0149 | 0.0271 | ~=—-2.76 1.48 
d- 100 200 via} 300122 wile 0.0153 0.0275 | 2.76 1.34 
e / 100 0.0122 G015>  ) @.0a77 | 2.75 1.27 


——_—_—— reese 
a 


Az if Paes ier | D fotI4+C—(4+D) f(K) («+ +07) mb 
0.00 0.35 4.45 = | 1.98 24.3 dia; 486 
0.00 0.35 4.37 = 2.06 25.3 506 
0.00 0.35 4.59 0.166 1.67 61.5 1230 
0.00 0.35 4.45 0.165 1.81 65.8 1320 
0.00 0.35 4.37 0.164 | 1.90 68.6 1370 


Unfortunately, it is not the case. In the case ¢,==15, where there is no region to apply 
assumption II, we obtain about 500 mb as f(K). This is far larger than 60 mb and 
two times as large as the corresponding geometrical value, which would be obtained if 
the radius of the nucleon were determined by the meson Compton wavelength. If we 
choose ¢,, as 100, we have about 1300 mb. The discrepancy is increasing much more 
in this case. Thus we may conclude that at least one of the fundamental assumptions, 
under which the dispersion relation holds, might be discarded at very high energies, if 


the experiments are consistent with the dispersion relations at low energy region. 
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Note added in proof The present method is not so useful to obtain the precise value of the total cross 
section at infinite energy, because the value of small factor A+B depends sensitively on the assumptions 
made, i.e., on the choice of &. It is preferable to say that the extremely large value of o++o7 at very high 
energies means conversely that the real part of the even amplitude below 300Mev is not expressed by the 
integral of the cross sections if s+-+o~ is kept nearly 60mb at high energies. This just corresponds to Puppi’s 
discrepancy in x~--p reaction. In the following paper we shall investigate in detail in what extent the above 
result is affected by the inaccuracy in the experimental data. 
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The main interest of the present paper is in analysing the fluctuations in the starting points of 
extensive air showers in the atmosphere. A method of obtaining the shower curve without taking 
into account the effect of the fluctuations is presented. The latter effect results in that small energy 
showers which thrust themselves into the lower atmosphere are observed as if they were large energy 
showers, and the effect is not insignificant especially in the case of observing showers of large sizes 
at low altitudes. This may also modify the primary energy spectrum in such a way that the intensity 
at very high energies is significantly smaller than the one currently adopted. Qualitative interpreta- 
tions of some unusual events and some speculations on high energy nuclear interactions are also given. 


§ 1. Introduction and summary 


Up to date, various experiments on extensive air showers have been performed by many 
authors””. It has been clear that extensive air showers are generated by primary cosmic 
rays of very high energies. Most of the primaries are protons and the rest are alpha 
particles and heavier nuclei. Their energies are estimated as high as about 10'°—10"eV, 
and one can obtain some information about such high energy nuclear events only from 
extensive air showers. An extensive air shower spreads over a wide area whose diameter 
is as large as several hundreds of meters, and, consequently, this fact facilitates us to 
observe such high energy events in spite of the very small frequency of the primaries. The 
information from extensive air showers is important not only for investigating high energy 
phenomena, but also for studying the mechanism of the retention and acceleration of the 
particles in our galaxy, through the primary energy spectrum and the relative abundances 
of heavy nuclei in primary particles. However, since the observation of extensive air 
showers is not a direct measurement of the high energy events but a partial observation 
of the. great number of particles that have been produced as a result of such high energy 
events, necessary information is obtained only through the use of statistical analyses. 

Recent experimental researches!” have shown many phenomenological features of ex- 
tensive air showers, viz. the size-frequency distribution, the zenith angle distribution, the 
absorption coefficient in the atmosphere, the lateral distribution of the electron-photon 
‘component, /4 mesons and the nucleon component at sea level and mountain altitudes, 
and so on. And the structure of the core of extensive air showers is also being studied 
at the present time. On the other hand, theoretical calculations» have been performed 
about the structure of extensive air showers by applying one of the models of multiple 
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meson production and electro-magnetic cascade theory. The comparison between such ex- 


perimental and theoretical results seems to be the only way of studying high energy 
events. 


Although it becomes clear that the extensive air shower is understood as the com- 
bination of a high energy nucleon cascade and an electromagnetic cascade which is derived 
from the decay of 7° mesons, a unique model that can consistently explain all kinds of 
the experimental data has not been obtained yet. Moreover, the complexity of the charac- 
teristics of the events is superposed on the above difficulty. It must also be mentioned 
that the results of theoretical calculations so far obtained do not always correspond directly 
to the observations. Most of theoretical calculations are those of statistical averages over 
the fluctuations due mainly to those in the atmospheric depth of the first nuclear collision 
and those further averaged over the primary energy spectrum. However, one can hardly 
know their primary energies of two such showers with an identical size that are produced 
possibly from primaries of different energies, if their starting points are different. Therefore, 


one must be very cautious in comparing the observed data directly with the calculated 
value. 


The purpose of our analysis is to find a clue to studying high energy events as well 
as to infer the primary energy spectrum important for the origin of cosmic radiation. It 
is important for this purpose to draw a basic picture from an experimental point of view,, 
not relying only upon theory. 

Our main interest is in the fluctuations as has also been pointed out by Kraushaar). 
Their effects have been studied under probable assumptions on the shower curve; the 
effects are not at all negligible but, rather, they explain some of the events that have 
been interpreted as unusual. The characteristic features of extensive air showers without 
fluctuations in starting points are described on a basis of a different point of view from 
those so far adopted. 


§2. Fundamental equation for the observed size distribution 


The longitudinal development of an air shower is expressed in terms of the variation 
of shower size (the total number of particles) with the atmospheric depth. The relation 
is called “‘ shower curve”. A usual shower curve shows an average behaviour of shower 
phenomena caused by primary particles of a fixed energy, but we shall discuss here a 
somewhat different implication of the shower curve. 

In an air shower, the first stage is a multiple production of secondaries by a collision 
of a primary cosmic ray particle against an air nucleus and subsequent events are complicat- 
ed nucleon cascades and accompanied electromagnetic cascades. The size of a shower, once 
it grows up, does not fluctuate so much on account of the great number of shower 
particles, but this is not the case in early stages, because the statistical fluctuations in inter- 
acting depths are large. Although the fluctuations in later stages might not be negligible, 
if the inelasticity in a high energy nuclear interaction is small, we may safely neglect 
their effects, so that we may be allowed to deal only with a fluctuation in the starting 


points of air showers separately from those in shower developments. After obtaining the. 
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shower curves starting from varying depths, they are summed up with appropriate ——— 
weights. Thus we obtain a shower curve expressed in terms of simple parameters which 
are rather directly related to fundamental quantities. 

Since the air shower is so complicated that the precise presentation of the shower 
curve is difficult, the following assumptions are used in the first approximation. 

(a) It can be said in the first approximation that the development of an air shower 
is a function of the air mass through which the air shower has passed after many generations 
of cascades. Furthermore, the shower curve is a function of the primary energy. 

(b) A characteristics of the shower curve is shown as such that the shower size 
gradually increases with atmospheric depth up to its maximum size and then decreases 
gradually, but it does not appreciably wiggle. Since, fortunately, the mean free path of 
nuclear interaction does not much differ from the radiation length of the electromagnetic 
cascade, the above approximation will be permitted as far as the inelasticity is not too 
small. 

(c) Among the showers which have been generated at a fixed atmospheric depth, 
a shower by the primary of a larger energy has a larger size than that by the primary 
of a smaller energy at any depth lower than the generating depth. 

(d) The primary cosmic radiation contains protons, alpha particles and heavier 
nuclei, but the assumptions used here are independent of the nature of these particles ; 
the above properties of the shower curve depends only on the primary energy. Of course 
the detailed shape of the shower curve depends on the nature of the primaries, but this 
problem will not be discussed in this paper. 

In usual. observations, one observes the frequency of air showers which have larger 
sizes than N at an atmospheric depth X. It can be thought that the observed frequency 
corresponds to the primary intensity in the following way ; since the previous assumptions 
show that the shower curve is a function of the primary energy E and the depth X, we 
shall use a threshhold C, and an observable range C, as the representatives of a shower 
curve, namely, C, is the path length necessary to develop the shower up to size N, and 
C, is the range in which the shower is detectable as that with a size exceeding N by 


usual observations. Then both C, and C, are functions of the primary energy and the 
size N: 


C,(E, iN}; C,(E, N). 


For the showers whose size is just N at depth X, two limiting cases can be found: one 
is the shower which starts at the top of the atmosphere and the other is that whose 
maximum size is equal to N and its position is at depth X. The primary energies of 
the former and the latter are mentioned as E, and E, respectively. (see Fig. 1) 

The primary cosmic ray has a mean free path L in the atmosphere in order to 


generate an air shower effectively, where by ‘effectively? we mean that the air shower 


under consideration gives observable effects. L may vary as a function of energy, but, 


since the variation is thought to be small, L is assumed as constant in the begining. 
According to the above definitions, the frequency of the showers whose sizes are larger 


sp oe tell 
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than N at depth X is irrelevant to the intensity of the primaries of energies lower than 
E, but only relevant to the primaries of energies between E, and E, and larger than E,. 
A shower from a primary energy between E, and E, is observable when it is started at 


Shower size 
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Fig. 1. Shower curves of two limiting cases (Ey & E2) 


the depth deeper than X—C,—C,, and becomes unobservable after the depth X—C,. 
The probability of detecting such showers is expressed as ; 
X—C,—C, » ( Cone 
I iS ey id Og Ore Cee 


The showers initiated by primary energies larger than E, are observable when they 
are started at depths smaller than X—C,, and the probability of detecting them is 


r= exp( 4 ao) 


Then the frequency for showers of sizes larger than N is expressed as follows : 


—(X-—CQ—C2)/L 


Ea —C2/L © mie Cy) 
R(>N)=|_ é(E)e (ages © )dE+\" 6) ee” a ane 


where g(E)dE is a differential energy spectrum of primary cosmic radiation. 
Quantitative values of the first and the second terms of the right hand side of the 
above equation must be determined by experiments. We cannot immediately determine 
the magnitudes of the terms in eq. (1) for the reasons that $(E) could be a complicated 
function of energy ; neither the shower curve nor the ratio of E, to E, is exactly known. 


The analysis concerning this problem is described in the next section. 
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§ 3. Attenuation of the shower size through the atmosphere 


Among the experimental results so far obtained, the attenuation of the shower size 
through the atmosphere has not been fully understood ; although a shower of larger size 
might be regarded to decline more slowly at first sight, the results so far obtained show 
nearly a constant value of the attenuation length independent of the shower size or a some- 
what shorter attenuation length seems to be observed for showers of larger sizes. This 
seems to be due primarily to the method of estimating the attenuation length. The 
procedure currently used is such that, first, R(> N) is observed as a function of atmos- 
pheric depth (sometimes the effect due to the variation of atmospheric pressure or the 
zenith angle dependence may be used) and then the absorption coefhicient is obtained 


from the next relation : 
— p= —1/h,=8 In R/OX (2) 


where 4,, is the absorption (the apparent attenuation) length. Since the observed size 
distribution is approximately expressed by a power law, if a shower size can be expressed 


as 

Nox BED, (3) 
the counting rate is given by 

R(>N) =A NO =e Ne. (4) 

From eq. (2) and eq. (4) one obtains 

A,=A, 7. (5) 
However, there is a question concerning the use of eq. (3) ; namely we can hardly accept 
such a simple assumption as eq. (3) from the reason implied in eq. (1). In other 
words, A obtained by eq. (5) represents a complicated quantity which can not always be 
understood simply as the attenuation length, on account of the fluctuation in the starting 
points of air showers and the deviation of the shower curve from that expected from 
this simplified model. 


Let us try to apply a method of obtaining an attenuation length similar to what 
was used for eq. (1). Introducing the following notations for brevity, 


F a Jr 6(E) ae -C2)/1 ( se ania (6) 
Fe=\ d(E) dE, (7) 


aoe remembering that C, and C, are functions of E and N, E, is a function of N ee 
is a function of N and X, and L is a constant, we have 
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1 OF Oke wubb evil 
ee pet b 2 lea atitbeg 7 2 Wes ON 
BEE COPI RR heme L: ~ “OEp OX OE, oie ") 
doy =: ax F+F, ‘ (9) 


According to the definition of E,(C,(E,, N)+C,(E,, N) =X) 
OF, MOE, me OF, WO: 


= 10) 
@E, OX OE, aX Mee 
then eq. (9) is reduced to 
; EF. 
/ Cp) be op be 2 5 11); 
E+E (11) 
By using r defined by 
F; 
pe Lite 12) 
Fic, — FP, 
eq. (11) is expressed as 
A~=L+Lr. (13) 


If exp|—(X—C,)/L] is sufficiently small as is expected in the deep atmosphere, F, 
becomes very large compared with F,/—F, and r is approximately equal to F,/F;. Hence 
eq. (13) shows that /,, does not reveal direct information on the attenuation length ; to 
be more exact, the meaning of /,, is a quantity which implies both of the mean free 
path of the primary particles and the penetration of low energy primaries into the atmos- 
phere. The attenuation of the showers is not the one assumed in eq. (3) but it is 
related to the shower curve in a different way from the current interpretation, as will be 


described in § 5. 


$4. The energy spectrum of the primary cosmic radiation 


The energy spectrum of the primary cosmic radiation can be obtained from the observ- 
ed size-frequency distribution by means of a suitable process of transformation. The 
method of transformation so far used is such that the mean value of the primary energy 
range, which has been considered to be narrow, is obtained from an observed size and 
the frequency with this size gives the energy vs intensity relation of the primary cosmic 
radiation. 

Here we will try to derive the primary energy spectrum on the basis of clearer de- 
finitions. First, E, in eq. (1) was used as the energy which corresponds to the size N 
owing to the one to one correspondence between energy and size. The value of E, can 
be obtained from the shower curve which will be analysed in § 5; it consists of the 
energy consumed by the electromagnetic component, which is obtained as a product of the 
total track length and the critical energy in the air, and the energies of the nucleon com- 
ponent, the /4 meson component and the neutrino component which are properly estimated. 

As E, is a function of X and N, we wish to obtain the relation between E, and 


‘850 


N at fixed X, but the observed data are not sufficient for this purpose. 
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A theoretical 


‘estimate shows that N is proportional to E* where @ is closed to unity (about 1.1— 


aye 


In this section, therefore, the energy spectrum is studied under the assumption 


that E, is proportional to N. A modification of this assumption will be described in a 


later section. 
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The integral frequency at E, can be obtained with the aid of the integral frequency 
at size N, the observed absorption length and the mean free path L; namely, by eq. (6), 
(7), (8) and (12) the frequency ¢(E)dE is equal to R(>N)r/(1+1r). Since the 
value of L is not observed yet, two values of L, 80 and 90 g/cm’, are adopted as is 
shown in Fig, 2. 

Fig. 2 shows that the intensity of the primary cosmic radiation in a high energy 
region is quite small in comparison with that adopted in the past. For example, at the 
size of 3X10" (the energy is about 3X10’ eV) where 4, is 90 g/cm’, the integral 
frequency at this energy is negligibly small for L=90 g/cm” and it is about one-tenth 
of the currently adopted intensity for L=80 g/cm’. 

The observed values of /,, are shown in Fig. 3°; it seems to converge to 90 g/cm’ 
as the size increases. If the primaries had an upper limit in their energies, L would be 
at least 90 g/cm® in the energy region concerned. 

The estimates of the primary intensity and L are mutually dependent on one another ; 
moreover, there is a possibility that the relative abundances of the components in the 
primary radiation vary as the energy increases. 

It is desirable to carry out direct measurements of L, which are probably possible by 
means of the observation of absorption of the showers which have an equal size and an equal 
shower age at every depth. Another important experiment is the size distribution near 
the shower maximum, because it approximately reflects the primary energy spectrum. 

Since such observations have not been performed yet, we can only conclude, accord- 
ing to the previous simple assumptions, that the primary intensity decreases rapidly in a 
high energy region and their practical upper limit is at about 5 X 10%eV. This conclu- 


sion is in sharp contrast to that adopted in the past. 


§5. Shower curve 


Observed data available at present are not sufficient to derive the shower curve con- 
clusively. Hence we shall describe only the outline of obtaining the shower curve in 
this section. The altitude variation of the counting rate has been obtained’) for a fixed 
density of shower particles (~50 /m?) and it has also been known already that the rate 
varies with the density proportional to J~'”. From these facts the shower curve can be 
obtained by the following way : 

(a) First, the observed altitude-frequency curve is reduced to that of the vertical 
component owing to the relation which has been used by Kraybill and Greisen’’ as follows, 


F(X) =<") [n+ 1-x2e | (14) 


where F(X) is the integral frequency of the vertical component of the showers which 
have densities larger than the fixed density 4, C(X) is the observed frequency, and n 


is a constant which, in the usual case of using cylindrical counters, is given by 


n=1.8(7—1) + 0.357—0.2 (71,5) 
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where 7 is an exponent in the density distribution. 

(b) Transformation of the rate F(>4) to the rate R( >N) 

Assuming that the lateral distribution of the shower particles is inversely proportional 
to air density and the structure is independent of the atmospheric depth, we express ‘the 
density at x from the shower axis as 

d= Nf (x) /x, (15) 
where x, is a scattering unit. Since the size distribution has a power dependence on N: 


RU > Ny Sea s, 


we have 


F(>4) =|"20 xdx Ry(x24/f(x))=* 


= Dare eee ee R, | (F(x) *xdx. (16) 


Accordingly, 
F(> 4) =F,d-*00x, 2-9 4-7 R,, 


As x, is inversely proportional to air density and the air density is empiricall ro- 
1 Y propo P yo 
ortional to X°°, the relation of the transformation is as follows : 

P > 


F,(X) oc R(X) X™ME-Y. (17) 


(c) Correction to the effect of the fluctuations 

One can obtain /,, from the gradient of the curve R(>N) versus X at every point 
of depths and also the value of r by means of eq. (13). Then we can estimate the 
integral frequency at E, by multiplying R(>N) by r/(1+r). 

The first term of eq. (1) is absent at altitudes higher than the depth at which E,=E, 
(this condition takes place at altitudes higher than the depth at which the observed count- 
ing rate is maximum). Hence eq. (1) in this region is reduced to 


R(>N)= es g(E) ge Ih 6 (E) e- 8-00 /2gE. (18) 


Let us substitute F, and F, respectively in the first and the second terms in the right 
hand side of the above equation then 


OF, Oa 
_ OF, 2, 1p, OF 8, 
1 dInR_ 98, OX L OE, OX 
Asp = Ox ee r= F, ce i (19) 
Since X=C, at the energy of E,, 
5 F,—F. 
Ag=L2—, (20) 
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Putting : 
r=F,/F,=L/U,+L), (21) 
we have 
|" §(E)dE=F,=R(>N)—1_. (22) 
J E2 1— r! 
100 
fe (c) 
[ 
Ee ) 
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C 
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Atmospheric depth (g/cm?) 
Fig. 4. Frequency-depth relation in §5, (a) (b) and (c) 


(d) Transformation of the frequency distribution into the shower curve 

The results of the above paragraphs are shown in Fig. 4 as curves (a), (b) and 
(c) (these are normalized at the depth of 1 kg/cm’ in Fig. 4). 

If we have a group of curves which are similar to (c) for every value of N, we 
can obtain the relation between X and N under the condition that the integral frequency 


for a fixed energy must be a constant at every depth; namely, the condition means 


(3 (E) dE=const 


J B2(X,N) 
and the shower cutve for E, indicates the relation between X and N from the above 


equation. 
Unfortunately, the curve is now known only in a narrow range around N=10" so 
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that a shower curve is obtained by assuming the size distribution N-'°. The curve is 
shown in Fig. 5. The position of the maximum is moved step = step by the processes 
of (a), (b) and (c) and finally it is estimated as about 300 g/cm’, but its value may 
be somewhat lower for the primary energy of giving the above shower curve because E, 
at the maximum of the frequency-altitude curve is somewhat lower than the energy under 


consideration. 


Shower size 


10° 


10° 


1 1 st. 1 OOO EE 
0 100 200 300 400 S00 600 7H 800 200 1000 


Atmospheric depth 


Fig. 5. Shower curve for the energy~10"eV 


Le) 


6. Discussions 


We have pointed out that the treatment of the fluctuations of the air shower develop- 
ment gives a more detailed approximation in the analysis of shower phenomena, but there 
are so many unknown quantities that a definite conclusion cannot be obtained. When 
the assumptions used in the previous sections for the characteristics of the air shower 
were varied, previous discussions would be modified as follows : 

(1a) In section 4, the energy spectrum has been obtained under the assumption that the 
ptimary energy is proportional to the size at sea level. But the size is thought to be 
proportional to the power of the primary energy with an exponent which is somewhat 
larger than unity ; accordingly, the tendency that the energy spectrum rapidly decreases 
in a very high energy region is more strengthened. 

(1b) In section 2, the shower curve is assumed to be independent of the altitude of 
its generation. But in the lower atmosphere, since energetic unstable particles cannot decay 
owing to the successive collisions with air nuclei, the proportion of the energy which is 


spent as electromagnetic components becomes somewhat larger and the effect of the fluc- 
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tuations discussed in the present paper is thought to be greater. 

(1c) As described in the last part of section 4, when the value of L and the composi- 
tion of the primary cosmic radiation, or more generally their characteristics, vary with 
their energy, the conclusion must be changed. However, this will bring about a very 
slight modification such that if L varies with energy in a similar manner to that given 
in Fig. 3 and the primary spectrum is similar to the one currently adopted, their absolute 
values are somewhat smaller. 

The effect of the fluctuations is taken into consideration to explain those phenomena 
which are hardly understood in terms of the averaged behaviour of air showers. For 
example, 

(2a) The fact that the very energetic nucleon component has been observed” occasionally 
in the core region of the showers will be explained from the reason that these showers 
are young; namely, they are showers started at the lower atmosphere. 

(2b) The high density 4 meson showers observed underground’ is also explainable’. 
If such a crude approximation is made that #2 mesons are generated from the center of 
the shower with a fixed angular distribution by a primary particle then the shower which 
started at the place as low as one-tenth of the average starting altitude will give a large 
density, about 100 times the usual one. 

The statistics of the observation is not yet good enough for the quantitative explana- 
tion of such special cases. 

Lastly, some of the speculations concerning high energy events will be presented. Of 
course, it is insufficient to discuss this problem referring only to the shower curve, but 
one would have to consult with future measurements of the structure of the shower by 
means of various methods. 

(3a) The inelasticity at a collision of a high energy nucleon component against an air 
nucleus seems to be 0.5—0.7 from the comparison of the shower curve obtained here 
with the result of a calculation. 

(3b) If the largest observed size was equal to a size at the maximum of the showers 
of an energy near the upper limit of the primary energies, the ratio of the size at the 
maximum to that at sea level is about 20 to 30. The ratio is nearly equal to that 
for showers of the primary energy of about 10”eV whose shower curve is obtained in this 
paper. On the other hand, the attenuation length is thought to be not indifferentiable 
according to the primary energy. Then it may be concluded that the energy of 7° mesons 
which conducts the shower curve does not vary according to the primary energy, and as 
the energy increases more and more energy goes to heavy mesons and nucleons, etc., than 
to 7 mesons. 

The author wishes to express his sincere thanks to Professor S. Hayakawa of Kyoto. 


University for his kind encouragement and helpful discussions. 
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Scattering problem for finite time interval is discussed by using wave-packet-formalism. A pre- 
scription is given which allows us to deal with the observations of different wave packets in an 
antinomic way. The unstable particle is investigated in connection with the transition probability in 
the frame of the wave-packet-representation. 


$1. Introduction 


For a recent few years many papers’? have been published concerning the unitarity 
of the S-matrix, the renormalization of physical constants, and so on, for the system 
involving unstable particles. It seems to us quite unsatisfactory, however, that in these 
papers the unstable character has been formally taken into. account, and authors have not 
touched upon more essential problems such as how to represent a state of unstable particles 
or how to recognize that an unstable particle actually exists for a finite time interval 
when we take into consideration the production of unstable particles from stable ones. 
The difficulties in answering the above problems come mainly from the fact that the 
unstable state is not the eigenstate of the total Hamiltonian. For instance, Glaser and 
Kallen” defined the unstable state in terms of the approximate eigenstate of the total 
Hamiltonian. The present author” obtained a similar definition by studying the produc- 
tion and decay of an unstable particle in the stationary treatment. Recently, Nakanishi” 
introduced a mathematical notion, complex distribution, and defined the unstable state as 
one which is nearly identical with an exact eigenstate of the total Hamiltonian with a 
complex eigenvalue and reached a somewhat different conclusion from the former. Then, 
he” confirmed his own conclusion by making use of the averaging procedure” for a finite 
time interval. 

In these papers the antinomic character of observations of unstable and those of 
other states was out of consideration. This character, which is automatically satisfied in 


the case of stable particles, is also an essential one in the case of unstable particles. This 


character is usually guaranteed by these two conditions, 


DhHi) Gi =1 (it) 


and j= 0, (1-2) 


* Formerly, Kunio NAITO. 
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that is, realized by choosing a set of states {|i)} forming a complete orthonormal set. 
The condition (1-1) allows us to interpret |(i{D)|? as representing the probability for 
finding antinomic |i) in the state |#). (In the theory of probability such a quantity 
as |(i|M)|? is called a “‘ probability distribution ”’.) 

The main aim of this paper is to give a prescription for a system involving unstable 
particles by taking account of the antinomic character mentioned above on the basis of 
the theory of scattering of wave packet.” Consideration on antinomy will enable us 
to understand the unstable states more clearly and intuitively. As a preliminary discus- 
sion, we shall give in § 2 representations of wave packets to interacting particles. In § 3, 
by using these representations some super-complete set of state vectors will be introduced 
and the antinomy of these states will be discussed. In the final section, the unstable 


state will be defined and investigated in connection with the transition probability. 


§ 2. Wave-packet-representation 


We shall begin with the discussion of the wave packet for the two free particles 4 
and B. The wave packet for the system of 4 and B having the average momenta p, 
and p, and the average positions r, and r, respectively at some instant, say, t=O, is 
given by 
| dle dle F, (Pi, 115 ey) Fe(Pes Pes he) ss Be) (2-1) 


where E.(p, Tq, k)=e"*-P Fk, p). 1, 2 (2-2) 


is responsible for the shape of the packet and |k,, k,) is an eigenvector of the free 
Hamiltonian for the particles 4 and B having momenta k, and ky, respectively. We as- 


sume that f,(k, p) is zero except for k~p, and we also assume the normalization 
| ah fick, p)|(?=1 i=1, 2. (2-3) 


The wave function of the ordinary space is 


(27) - tp | dhe F.C, P). (2:4) 


Denoting the uncertainty of the momentum and position by Jp and Jr respectively, we 
have, of course, dp-dr~1 by uncertainty principle. 

We can classify the wave packets of two particles into the following four classes 
corresponding to relative momentum and position of two particles. 


Suppose a wave packet which represents a state of particles A and B havi 


fic} 
large distance |r,—r, ng a sufficiently 


> |dr| at t=0. If A and B are approaching to or departing from 
each other at this instant, then this packet is defined to belong to the class a or b res- 
pectively. If the above 4 and B will pass or have passed by each other in some instant 
but never come closely, namely their distance can never be smaller than dr, then this 
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packet is classified into the class c. Finally if A is present in the vicinity of B with the 
distance |r,—r,|<|dr|, this belongs to the class d. Here |4r| has been defined by 


[4r}==Maxi|4r7), |4r.|}, at t=0. 


It will be seen that this classification will become ambiguous owing to the increase of 
|dr|, for instance, when the wave packet will be deformed before arriving at their closest 
position. But such a packet need not be considered, as we shall see later, since this. 
packet gives no contribution to the calculation of transition probability. 

When two particles A and B are interacting with each other, the most straightforward 
way to obtain the statevector corresponding to (2-1) is to replace |k,, hk.) in (2-1) 
with an eigenvector of the total Hamiltonian describing a state of two physical particles 
with momenta k, and k, but no mutual interaction, but it is quite evident that such an 
eigenstate cannot exist because of [p;, H|0. In the case of |r,—r,|>|4r|, we can 
consider a wave packet representing one particle state for each one in the very neighbour- 
hood of each particle, accordingly the statevector for the system can be given in some 
meaning as a product of quantities corresponding to such two packets. Let this state be 


denoted by 
Pi, T,; Pr Ts). (255) 


In addition, we define |p,, 7,3; Po, T2)* as follows : 
|Pi> T,; Pr» r.)*= | dhsdhsF (Py r,, ky) Fi(p2, Me, ky) |k,, ky)=. (2-6) 


Here |k,, k.>* or |k;, ky)” is the outgoing or incoming solution of the Schrodinger 
equation with the incident momenta k, and k,. Now we shall consider the quantity 
+k, kee\pis T13 Pe» Te) for a wave packet of the class a. For convenience sake, let 
the outgoing solution be separated into three parts, the incident plane wave, the outgoing 
spherical wave and the rest which vanishes rapidly for a large distance between colliding 
particles. The only contribution to *(hy, kolpi, 713 Po» To) in the packet of class a 
may be expected to come from the first part of the outgoing solution provided that |p,| 
and |r,—r,| ate sufficiently larger than |4p,| and |dr,|. Then it seems reasonable to: 


define that (see Appendix B) 
* Ck, ky| Pp, T,; Pr» r,)=F,(p,, r,, k,) F, (ps; To, ky) (2-7) 


for class a and that scalar products of IPi, T13 Po» To) with outgoing solutions other 

than |k,, k,)*, or with bound states are zero. The same argument can be performed 
; . . 

for incoming solutions and a packet of class b. Therefore, recalling that the outgoing 


, 
; : ; Ri 
or incoming solutions together with bound states form a complete set respectively, we get 


|p, Tr; P»2 Tr.) =|P1, Ti; P» foe (2-8) 


for a packet of class a and 
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|P.; T,; Po r.)=|P1> T,; Ps» ri) (2-9) 


for a packet of class b. 


§3. Orthogonality and completeness of wave packet 


In this section we shall find a set of the statevectors representing wave packets and 
satisfying the condition (1-1) which is necessary in order that the observations of the 
different states are antinomic. As an example of this set, we can take {/p,,-T,3 Ps; 
r,)~} ==© defined by (2-6). From the definition, we can easily see that 


(22)~*| dr,dr.dp, dp:\P, T,; Po Fay? Pia T,3; Pz, Te\ 
= | dpsdp.dhe ae fe, pL foCh 2)!" Ibin Ba) ~~Cliy al 


=| dhesdh| hy, hg) Ie) (3-1) 


Here, we have assumed that |f,(k, p)|® is a function of kK—p and have used the 
formalization of (2-3). Now for the sake of simplicity, we assume that {{k,, k,)~} 
forms a complete set. Since the extension to more complicated cases is straightforward, 
we shall restrict ourselves to this simple case. By virtue of the above assumption and 
(3-1), we see that © is one of the sets of the statevectors which satisfies the condition 
(1-1). As to the condition (1-2) for orthogonality, we have the following relations : 


=1 for p,=p,’, r=r, 
no ih Torre / = ry) " j j rt | 1 
(Pv; T, 5 Po» r,'|p,; T,3 Po, To) =4~1 for \P:— Pi | ~|4p,|, ljr;—r,y || 4r,| 
=0 otherwise. (3-2) 


Namely, in our case, the orthogonality between any couple of nearly identical statevectors 
does not hold and norm of each statevector is equal to unity in contrast with the 
customary formalism where the statevectors are used to be normalized to d-function. Owing 


to this peculiar situation © is to be called a “ super-complete set”. The fact that © 
satisfies 


cee dr, dr, dp, dp,|p,, T,; Pr, yeas ri; P., r,|=1 (3-3) 


Sa be roughly regarded as due to the device that the norm of each vector is contracted 
in such a way as to cancel out the super-completeness of ©. 
G . a — . ~~ 
: onsider a statevector |p,, r;; Po, Ts) belonging to ©. As already mentioned above, 
‘ ~! . ~ 
ey exists a subset ©’ of elements in @, any element of which is not orthogonal to 
k 
e above specified vector. Let us consider the following expression (3-4) for a state 


|®> 


(27)*| dp, dprdr,dra\-(p,, ri3 Pes Te|@)|* (3-4) 


D 


as ee 
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where the domain D of integration includes the above introduced subset ©’ and is 
sufficiently wider than it. In this case we can interpret (3-4) as the probability for 
finding in |) the states belonging to the domain D, while if D is smaller than ©’, 
namely it cannot cover ©’, this kind of interpretation is meaningless. For, in the present 


formalism, statevectors belonging to ©’ 


cannot be distinguished from each other from the 
observational point of view. In this restricted sense, the quantity |~(p,, 7); Ps, T2|%)|* 


can be called the probability distribution. 


§ 4. Transition matrix and unstable states 


In this section, we shall investigate the temporal variation of the wave packet in 
order to understand what the wave packet |p,, r,; P», T2)~ represents and to get the 
expression for transition probabilities. Let us consider a state of the wave packet of the 
class a at t=0. This state will become 


e*™\p,, T,; P2> ra)’ = | dled. (Pr, T1, k,) F, (pr, r,, k,) 
Xexp{ —i(E4(Iy) +En (he) )t} |i, be)* (4-1) 


at a time t, where E;(k) = 1/k?+m2 is the energy of the physical particle i with the 
mass m,(i=A, B). 
We can rewrite (4-1) as follows, 


| dh dleP (Bis r+ VaCP) )Fe(Pes Pet Vinee) hy) 
x exp{—i(E4 (hy) — hey Vapi) + Pr Va (ps) +E 2 (he) 
— key: Vn (Ps) + Po Vn(p.))*h lis Ie)*, (4-2) 


where V,(p)=p/E,(p). It is easily seen that the exponential factor in (4292) iis 
nearly constant for the variation of k’s, since the partial derivatives of these factors with 
respect to k’s are nearly equal to zero at k~p and F vanishes for k Ap. In general, 
this exponential factor gives rise to some deformation of wave packets while traveling, 
but we shall be concerned with the case where this kind of deformation can be neglected. 
(Low and Chew” showed for the actual elementary particles that except for very slowly moving 
electrons this deformation is small enough to make the wave-packet-formalism possible for 
the traveling distance 100 cm.) From (4-2) we see that the wave packets move freely 
and their velocities are V;(p). Since the wave packet (4-1) for t=0 belongs to the 
class a, there exists a positive time T for which 
r,t+V.4(p)T=r.+ V2 (p,) L. (4-3) 

As the arguments of F’s for t>T in (4-2) indicate that these F’s belong to the class b, 
to such a statevector it seems difficult to give a simple picture as was pointed out in § 2. 


Therefore (4-1) or (4:2) represents a system of two physical particles having average 
momenta p, and p, and approaching to each other until the instant T. However, at T” 
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these two particles collide with each other, accordingly (4-1) or (4-2) can be interpreted 
as representing the state of this system after the collision for t>T. The above con- 
sideration gives us a way of interpreting such a complicated statevector. 


Let us define the transition matrix by 
=4P35 155, Pio Kile |\Pis Tres. Pes r,)* 
= | dle dhe dhy dhs F* (Ps, 3, Kz) F,*(py, Ta, hs) F, (Pi, 11, k,) F, (ps, T2, kz) 


 exp{—i(E.(Ihs) +En(k)) #} (Bs, alS|hs, Iz), (4-4) 


where we have used the definition of the so-called S-matrix (k,, k,|S|k,, k.)=~(k,, 
k,|k,, k.>*. The S-matrix can be decomposed into two terms. 


(ks, k,|S|k,, ky) =0 (k,—k,) 0 (ke,— ky) 


+0 (ks + k,— k,— ky) 0 (E 4 (kg) + Ex (ky) —E,(k;) —E,(k,)) (kh, k,|Rik,, k,). 
(4-5) 


As the first term describes the transition of two particles without mutual interaction, it 
is of little interest to us. 

We shall inquire into the condition for p,, r,, P,, T, for getting a non-vanishing 
transition probability. First, we shall examine the case where the phase of R-matrix is 
nearly constant for variation of k,; at the neighbourhood of the fixed p,;. In this case, 
the condition for p;, r3, Ps, T4 is that these must satisfy the following relations (4-6)— 
(4-8) within the error of dr; (see Appendix A). 


Ci ce te V « (ps) t= (V4 (p,) a V .(ps))T, (4-6) 
r4—Vr,— V2(p,.) t= (Va(p,) —Vn(p,))T, (4-7) 
r,—r,= (V4(p,) —Ve(p,) )T. (4-8) 


Here, T is the same one as that introduced before. Therefore in order that the collision 
takes place, t must be chosen larger than T. (4-6) and (4-7) for t>T show that 
the set of parameters (p;, Py, Ts, T,) corresponds to a packet of the class b, accordingly 
Pry pry represents a system of physical particles having momenta p, and p, 
and departing from each other. The transition probability for this process is given by 


(27)~*| dp,dp.drydra\ (pa, T3; Pas rajer*|p., Ti; Ps reps i> (4-9) 
dD 


In the second place, we shall treat the case where the R-matrix has a pole of the 
following type (R’=const.) : 


R=R'( (key+hy)?— (Eq (Kes) +E» (ky) y+ (M— r) yen (4-10) 


Let us assume that M>/", [’S |Jp,| and discuss the case 
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(pit Pp.)?— (E.4(p.) +E, (p,))?x (p3-+ ps)? — (E 4 (ps) +E, (py) )?~ MP, (4-11) 


since, otherwise the pole of R-matrix will not give any affection to the condition (4-6)— 
(4-8). In this case, (4-6) and (4-7) are to be replaced by (see Appendix A) 


r;—r,— V4(ps) (2 ati EPs payasvoduy s 


MI.’ Mr 
=(V.(p) —Vo(p,) )T, (4-12) 

noni Vn(p)( Pale + Fe 2PO g)— PP g 
=(Va(p.) —Va (py) )T (4-13) 


and (4-7) remains unchanged. Here a is an arbitrary positive number. The transition 
matrix (4-9) becomes proportional to exp(—a’) in a rough approximation (see Appendix 
A). Therefore a should be at most of the order of unity. Comparing (4-12) and 
(4-13) with (4-6) and (4-7), we see that the relations (4-12) and (4-13) can be 
satisfied for the set of parameters (p;, Py, 73, 74) belonging to the class b if and only 
if 
p> T +E 4lPs) FE a (Pd) ey, (4-14) 
MI’ 


In this case the statevector |p,, r;; Ps, 14) ‘epresents the same state of particles as 
mentioned before and the quantity |(p,+-p,)|@/(MI") can be regarded as the distance 
between P and Q as shown in Fig. 1. 


time PtP 
4 r+V Ap) T+ 


4 Fal part Eales a 


= space 


4 rtVa( py) T ji 


Fig. 1. Schematic representation of the relations (4-12) and (4-13) 


The aa interval between the world points P and Q can be interpreted as the “life 
time” during which some dubious state acutally exists. The dubious state can exist as 


an intermediate state with the probability being proportional to exp(—a). Therefore it 
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seems quite reasonable to interpret this intermediate state as representing an unstable particle 
whose velocity is (p,+p,) /(E4(p;) + Ex (ps)) and the mean life time is roughly equal 
tom / Le 


Next, let us consider the case of 


E ,(p;) +E »(p,) 
MI’ 


T+ Pe A eat bet (4-15) 
In this case the relations (4-12) and (4-13) can be satisfied only for the set of para- 
meters (p;, Pi, 73, 74) belonging to the class a or d. For such a set of parameters, 
the statevector |p,, 73; Ps, Ta) does not give us a simple picture of the system but 
we should rather interpret the quantity 


have \ dp,dp,dr,dr,|~ (ps, T33; Ps, r.je aed | 2 Ti; Pe, r.)|? (4-16) 


aadd 


as the probability for finding the unstable particle. Accordingly it may be feasible to 
define unstable state by 


\ dp,dp,dr.dr ,| ps, T;33 Pa, rs, Ph, T33 Ps; r| <P ls T,; Py, re)*. (447) 
aand d 


Now we can rewrite (4:17) as the integral 


oo 


Epinions ds, dh dhs, dav Ps er 


aand d 0 
re (Das rs, ky) Fo* (pa, T4,; k,) F,(p,, r,,k,) F;(p,, r., k,) 
x vevexp| —a—i(E4(I) +E 9 (hy) 
—i | Uy thd? (Exh) +E x(k) +P —T"L (MP) | (4-18) 


where the integral parameter a is nothing but the one appearing in (4-12) and (4-13) 
(see (A-6)). Since the integration with respect to p’s and r’s should be made over 
the classes a and d, the relation (4-15) should be satisfied, since otherwise the integrand 
vanishes. Therefore the domain of integration with respect to @ is actually reduced to 
(co, (¢—-T)MI'/(E4(p,) +Ex(p,)), from which we see that the expression (4-18) 
decreases exponentially with respect to t, as the function exp —i| (k,+k,)*— (E,(k) — 
E,(k,))?+M?—I"]a/(MP) is in practice nearly independent of @ (note (4-11)). 
As was discussed at the beginning of this section, the expression (4-1) is equal to 
[Pus Tit Va(p, es poy Pe V,(py)t>*, except the trivial constant factor, provided the 
deformation of wave packet is neglected. Therefore, (4-17) can be rewritten as 


| dp;dp,dr,.dr,|p,, T33 Ps, ri)-~ py T33 Ps, r,!P., ry P2> wy. (4-19) 


aand & 


- 
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Here, the relation of p,, r,’, Py, Ts’ must belong to the class b owing to the condition 
t>T. It is quite interesting to see that the decay spectrum can be represented by 
OG Od Fee Url Win eee P2, T:)*|? as shown by (4-19). 
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Appendix A 


In this appendix, we shall investigate the condition for p,, r,, P., T. and p;, rs, 
Ps, T, for getting a non-vanishing transition matrix. First, we shall consider the case 
where the variation of phase of R-matrix can be neglected. In this case, the variation 


of the phase of the integrand of (4-4) apart from f, with respect to k, is given by 
4=dk,-1r;+ 4k,-1r,— 4k, -1,— 4ky-r.— dk;-V4(p;)t—4k,-Vz(py)t. (A-1) 
Now, 4k,’s are restricted by the 0-functions for the total energy momentum.  Thes: 


restrictions are 

4k, + 4k,— dk,— dk,=0 (A-2) 
and Ak,- V4 (p.) + dk,-Vz (P») — Aks-V 4 (Ps) — 4k,-V, (p,) =0- (A 3) 
In (A-1)-(A-3), V,(k;) and V;,(k,) have been replaced with V4(p,;) and V,(p;) 
by virtue of f,;. By using (A-2), 4k, can be eliminated from (A-1) and (A-3),. 
namely, 


4=4k,- (r,—Ty— Vi, (p;)¢) + dhy: (yr V , (ps) #) — dk, - (r3—ry) (A-4) 


and 4k,- (V4 a) —V, (ps) ) + 4k, (Va (P.) aie (P,)) 

— dk, (V4(p,) —Ve(p2)) =0. (A-5) 
In order that (4:4) is not zero, J must be nearly equal to zero for the variation of 
dk, restricted by (A-5), when Jk, is of the order of Jp,;. Using the undeterminate 


multiplier T, we get (4-6)—(4:-8). 
Secondly, we shall consider the case where R-matrix has the form of (4-10). The 


expression (4-10) can be rewritten as 
“pr ; : ; 3 
ur \@ exp| —a—i{ (+h)? (E4(hs) +E _(ky))?-+ M0} a/ (MP) |. 


0 


1 
(A: 6), 


The variation Jd’ of the phase of the R-matrix is 
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pide 6 lee Ses 99 eee ee 
4 “|e + p,) - (4k, 


er] (E, (Ps) + E, (p,) ) (Via (ps) . 4k,+ Vy (p,) 7 ak) |. (A- 7) 
Adding this variation 4’ to (A-4), we can obtain (4-12) and (4-13) together with 
(4-8). The condition (4-11) is derived from the integration with respect to @. From 


(4-11) and (A-6), we see in a rough approximation that the transition matrix is pro- 


portional to exp(—a@). 


Appendix B 


Using the Lee model," we discuss the relationship of wave packets with outgoing 
or incoming solutions in more detail. When V-particle is unstable, the outgoing or in- 
coming solution of N—d@ scattering |N, 0,)* separately constitutes the complete set and 


is given by 
INt)*=(|daz*(q, b)a/*ys* +a*(k)$*))0), (B-1) 
+ Bs 
where 1*(q, k) =3(q—k) —g fed 2 ®) _, (B-2) 
V @, W,— Oo, FI1€ 
at (Ie) = FEW) 5+ Cy + ex) (B-3) 
V 20% 
aad SMEV=E— mateo | ax fies) (,—E+m,tic)".  (B-4) 
. Ox 


Since the bare N- or #-particle is identical with the physical N- or (-one, the state 
corresponding to (2:5) is 


dk F(p, r, k)a*(k)¢hy*|0). (B-5) 


Ip r= 


Here, r is the relative co-ordinate between N- and ¢-particles. Using (B-5), the 
quantity corresponding to (2-7) is 


*(NOy|p, 1) =F(p, r, k)—g\dqled. 9") pcp, rg). (B-6) 


V 2m, Wg—aztie 


The phase variation of the integrand of the second term of (B-6) is 


d= — dq. (r+ ¥(p) —— ~—_.). B-7 
| (@—a,)?> +e ( ) 
Therefore, in order that the second term of (B-6) is not zero, r and V(p) must have 
the same direction. In this argument, care must be taken, since the variation of the 
absolute value of the integrand is larger for «@,~w,. We can see, however, that this 
argument is correct owing to the fact that the sign of the variation of the absolute value 


Pate a or 
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for some specified dq depends on the sign of w,—w,. Hence, we can conclude that 


*(NO,|p, 1) =F (p, r, k) (B-8) 
except for the classes b and d, and 
-(NOx|p, 1) =F (p, 8, I) (B-9) 


except for the classes a and d. 
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Quantum Theory in Pseudo-Hilbert Space 


Gaku KONISI and Takesi OGIMOTO 


Department of Physics, Osaka University, Osaka 
(Received July 31, 1958) 


A method for treating the Hilbert space with an indefinite metric operator is given. Any 
transformation of state vectors and operators appearing in physics is generated by a “pseudo-unitary 
operator”. For such transformations two kinds of transformation properties, “covariant” and “con- 
travariant”, are considered, just like in the case of the Lorentz transformations in the Minkowski 
space. It is shown that the difficulty of the Lorentz non-invariance of Gupta’s theory in the quantum 
electrodynamics (reference 2) is due to the ignorance of such transformation properties. Applying 
our method to the electromagnetic field, we formulate a covariant quantum electrodynamics. 


§ 1. Transformation properties of state vectors 


Let us start with an ordinary Hilbert space. We introduce a metric operator £ 
such as 


f—1 


=€ and “€“ exist, (1-1) 


So 


where ¢* is the Hermitian conjugate of ¢. S°=1 need not necessarily hold. 
As usual, the states of a physical system are represented by vectors of the Hilbert 
space, whereas we postulate that the probability should be given by 


{x|$|x). (1-2) 


< 


Since the sign of this “ pseudo-norm”’ (1-2) is, in general, indefinite, in order that it 
may be interpreted as probability, we must put a subsidiary condition on state vectors 
which assures the positive definiteness of (1-2) at least for the physically allowable states. 
(In case of the electromagnetic field the positive definiteness is, as well known, guarranteed 
by Virtue of the Lorentz condition. ) 

We postulate that the metric operator ¢ is invariant under any transformation con- 
sidered in physics. Then the operators which keep the pseudo-norms (1-2) invariant 
must satisfy 


USF 1 UE (1-3) 


where U* is the Hermitian conjugate of U. We call such an operator U a “ pseudo- 
unitary operator”, and it will play an important role in the following discussion on the 
transformation properties. 

Let us consider some pseudo-unitary transformation. If a ket vector |x) is transformed 
under this transformation by a pseudo-unitary operator U as 
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aU x (1- 4a) 


‘ 


we call it a “covariant ket” and mark it by a dot symbol attached below as |x),. The 


ket |x)"=¢|x), is then transformed as 
ix) > |x') = SU |x) =U Ix). (1- 4b) 


Each ket vector with such a transformation property is called a “‘ contravariant ket”? and 
denoted by a raised dot symbol. The operator V=<sUS~" satisfies 


= 


Vimeyr et (1-5) 


and, in general, any operator satisfying the relation (1-5) is to be called a ‘ pseudo- 


“e 


unitary operator of the second kind”’, while the operator U introduced above the “‘ pseudo- 
unitary operator of the first kind”’. 
As the metric operator ¢ changes |x), into |x)° and ¢~' changes |x)° into |x),, it 


will be convenient to denote them as °° and ¢,, respectively, i.e. 


Sry ex Phas, ii te. (1-6) 
Thus $°°(¢,,) is the operator which raises (lowers) the dot symbol, quite analogously to 
g’* (Guy) in the Minkowski space. 
The dot symbols can be attached also to the pseudo-unitary operators of the both 
kinds, U and ¢US—', as 


eae andes SS SoU <5. =U". (1-7) 
With these notations (1-4a) and (1-4b) are rewritten in an elegant way: 
[aa Ixy es cand) x) =U |x). (1-8) 
The bra vectors dual (in the ordinary sense) to |x), and |x)° are transformed as 
follows : 
Kx | > =a] (OL) * SUP, 
(1-9) 
x] | =x] (UL) * =" |UT 
where 


(aU ard, U(r (0), “(1 10) 


Any bra vectors transformed like <x| or “(x| are called “covariant”? or “ contravariant 


bras” respectively. Evidently, we have 
jC er Cd Ci Cleary) 
and 
Vitis ULF , Ut =F? Ute, (ini) 


We can immediately see that the only inner products which are invariant under the 


transformation considered are those of a covariant vector with a contravariant one: 


*Cxlyy, = Cxlyy = CaF" |). =" lS ly” (1-13) 
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Of course, there may exist vectors that are neither covariant nor contravariant, but they 


are irrelevant to the covariant description of the theory. 
For the sake of convenience, the space composed of those vectors which have the 


* . . > 
afore mentioned transformation properties is called the ‘“‘ pseudo-Hilbert space”. 


$2. Transformation properties of operators 


We now investigate the transformation properties of operators. There are four kinds 
of operators that are relevant to the covariant formulation of the theory. They are 


characterized by the following transformation properties, respectively : 


i) Covariant operators : 2,.32',.=U,2,.U™, 
ii) Contravariant operators : 2°» 2"=—U 2 Ut 
2-1) 
iii) Mixed operators of the first kind : Be a = 2S GS 


iv) Mixed operators of the second kind: 2°,» 92" =U", 2°,U™ 


The dot symbols again indicate the transformation property of the operator. The metric 
operators 5°" and £,, should be a contravariant and a covariant operators, respectively, in 
order that their dot symbols may be in conformity with the above definitions (2-1). 
These transformation characters and the pseudo-unitarity of the U-operators give rise to 
the numerical invariance of the <’s, which is consistent with the postulate in $1. We 
can easily show that the above four kinds of operators are mutually related to each other 
by raising or lowering the dot symbols by means of the <’s, i. e., 


2. =F. 2° =F OF | and so on. 


Eee aS F = fia dL, (2-2) 


Care must be taken in forming a product of operators. Since we shall deal only 
with the operators whose transformation properties are (2-1), and since other kinds of 
operators such as 2...., etc., are irrelevant to our formulation as will be easily seen, we 
must always form a product of operators in such a manner that it also belongs to one 
of the four classes (2:1). Thus the dots left non-contracted are always located in the 


both ends of the product and determines the transformation property of this product. 
For example, we have 


2. 2:."= 8)... = (8 OS * 


ef ef ** . . 2-3 
2,°° 2, Oy =O," 9, O, =. = (OG OS" eee, ‘Be 


Se 


The above procedure is henceforth referred to as the “rule of proper products”. This. 


tule is also applied to products involving vectors, and we are to deal only with such ex- 
pressions as follows : 


— = Ft 
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| 


2))|x)eme lays 
Qe eaerx 


(2|x>)., 
“(| 2), 


(pre 


Kes 
, : (2-4) 
Cx) = rly) 

"C420 |y) = "Cx |2.. yy = 


As is already mentioned, our proper products correspond to the contractions in the tensor 
calculus, but there is no counterpart of the direct products of tensors (the only exception 
is the diadic expressions such as |x)” .{y| in which the dot symbols should not be con- 
tracted), because quantities with many dots are irrelevant to the present theory as already 
emphasized in § 1. 

The convention of the rule of proper products allows us to omit the annoying dot 
symbols in formal calculations. 


The expectation value of an operator is assumed to be given by the invariant : 


(x|O."|x), "(x1 Q,, [xo ae (2-5) 


We define the adjoint conjugation of an operator through the complex conjugation of the 


expectation value, i.e., if 
(Cx]2.°|x).) *¥=" | (2) Tx), 
ese Nin ie (CORR SEs darete 


hold for an arbitrary vector x, (2,°)7, etc., are called the adjoints of 2,", etc., respec- 
tively. More explicitly the adjoints can be defined in terms of the Hermitian conjugation 


(2-6) 


as follows : 
C2) tan)? 
(0°) t= (.2")* 
(2,°) 7 = (2",)*=6,, ( 
(DPS (O, AEE" (DD *E., 


(2-7) 
(Oh) * Eee 


(2-7) shows that the present definition of “adjoint” is equivalent to the former one 
introduced in (1-10). It is easy to show that the adjoint conjugation conserves the 
transformation property of the operator. Hence we can write CONV tes OT, (2° )'= ar 
etc. From this reason we see that there can exist “self-adjoint”? operators which are 
identical with their own adjoints. Here it must be noticed that the adjoint of a product 
of operators is, in general, not identical with the product in the reversed order of the 
adjoints of the individual operators. From the definition (2:7) we have, for example, 


(2,,.2. ts ((2, UG eo Wie COO: Q,) = Gos Oe 4 
= (0...) * (25) * = Oyt Gt (se 27°F 2,7). 


Thus the transformation property is, of course, conserved. 
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The transformation property of an inverse operator, if exists, is, in general, different 


from that of the original one: 
(CO jAS 0) (ae alee es 
and (9°) =O”. (2-8) 


§ 3. Eigenvalue problem 


The eigenvalue problem of a self-adjoint operator Q is investigated. The eigenvalue 
equation to be used here is somewhat different from the ordinary one: 
2. |Wa).= Walz). 


(3-1) 


\e | sees = | Ke 
2, .|we) =Wz|We).— a> ..|\Vx) - 


The latter equation differs from the ordinary eigenvalue equation of an Hermitian operator 
Oi aN 
The equations dual to (3-1) are 


* 


. . es 
{wel 2, =W," (Wa\|, etc. 


From this and (3-1) we have 


*Lewa| 2." |wg), =a* “welws)>. =@3 Cwa|@s).5 
(Wa*— os) “(we|@3).=9, 
so that 
if “<w,|e,.).200, oa =a, (real eigenvalues), 
and if ,* eae, *{wa|@s>.=0 (orthogonality). (3-2) 


These results are the same as those given by S. Gupta and W. Heisenberg in their recent 
works.” 

We now assume that the eigenvectors of © forms a complete set in the Hilbert 
space (though not orthonormal in the ordinary sense). Then one can take suitable linear 
combinations of the |w)’s such that the new set contains no vectors with vanishing 


pseudo-norms. After normalization we get a complete “ orthonormal” set satisfying 
"(Walp ) = Ges = Ee Fans €,=— +1 or —l1. (3-3) 
The following equalities are immediate consequences of (3-3) and the completeness : 


1. =2'4.3|Wa). a “Cos! =2,.8|@)° ia Cars| ’ ial = Jes 


Bs (3-4) 


=o a6 |We)” te “Coal > Sis =F a5 \Wa). s <os| - 


$4, The electromagnetic field 


This section is devoted to formulating the covariant quantum electrodynamics by 
applying our method to the electromagnetic field, and to pointing out that the difficulty 
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of the Lorentz non-invariance? of Gupta’s old theory® is due to the ignorance of the 
transformation properties. 


The electromagnetic four-potential 4, (x) is assumed to satisfy the following com- 
mutation relation : 
ee CS A,(y) |=—i9y,D(x—y). (4-1) 
Then we can immediately see that A,*(x), the Hermitian conjugate of A,,(x), also 


satisfies the same commutation relation as (4-1) so that A,(x) and A,*(x) are mutually 


related by a similarity transformation operator 7, namely, 


Ay (2) Jada (x) 7) Od 03 (4-2) 
Using (4-2) and its Hermitian conjugate equation, we easily get 
[y"4*, A, (x) ]=0, 


owing to which 7 can be chosen to be Hermitian as can be easily seen. This procedure 

means to give up the usual Hermitian or anti-Hermitian representation of the operators 

A, (x), and allows us to formulate the whole scheme in a Lorentz-invariant way. 
Suppose that the 4,(x)’s are transformed by an operator U as 


A, (x) > A,!(x) =UA,(x)U™ (4-3) 
under some physical transformation such as the Lorentz transformations, the gauge trans- 
formations, etc. The operator U must satisfy 

Un U*y=c 
where c is a real number, if we postulate that (4-2) is also valid for the same 7 after 
the transformation (4-3). By suitably choosing the arbitrary c-number factor in U we 
have 
Usha 25 7p, (4-4) 


Here we shall consider only the case of the plus sign, which occurs, at least, for the 
proper Lorents transformations and the gauge transformations. 

Let us now adopt the operator 7 as the metric operator (1-1). Then, according 
to the equations (4-2), (4-3) and (4-4), U is a pseudo-unitary operator of the first 
kind (U,"), whereas 4,,(x) is a self-adjoint mixed operator of the first kind (4,(x),”). 


The Fourier expansion of A4,,(x),” is 
A, (x). =2% rapl (ie) Se ?-a,f (he) er"), Wo | 
and the commutation relation of a,(k),” and a,'(k),” is given by 
[a,(k)., t(D) |=Iuv net: , (4-5) 
or briefly 
(a,°, at J=el., e=+1 for v1, 2, 3; 


—1 for p=0. 


(4.5)! 
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We define the “number operator”? N,” by 
N, =eai, a, (4-6) 


which is apparently self-adjoint but is not non-negative definite. Therefore it — 
even complex eigenvalues as we have seen in the preceding section. However, if N, —_ 
be restricted to have non-negative integral eigenvalues, we can still interpret a,” and a? 
as the annihilation and the creation operators, respectively. 


< 7? IAN ° 1),4) 
Now, assuming the existence of the “vacuum state 0), with the property 


0),=0, *{o|0),=1, (4-7) 


we can construct a set of eigenvectors of N,” belonging to non-negative integral eigenvalues 


as follows: 


jn), == (n!) AP (ai)""|0)., “Cal = (2 Ol (@*) 


N,"|n),=n|n)., n=0, iy Z, os 


(4-8) 


This set is orthonormal in the sense 
*(n|m) =e" : (4-9) 


By virtue of the assumption a,"|0),=0 the space , spanned by the set {/nm),, n< co} 
is an invariant subspace for the operators a,” and a’.*. Namely, there does not occur 
any transition between states belonging to §, and to its complement. Therefore, we can 
confine ourselves to this subspace, assuming that all the physical states belong to §,. 


The alternative way for this confinement is to adopt the projection operator into §, : 


C= ao lm), €" Cn]. (4-10) 


It will be easily seen that we can formulate the quantum electrodynamics in a 
Lorentz-covariant way by following the line of reasoning of Gupta’s old theory. For 
instance, the number of transversal photons can take only values of non-negative integers 
owing to the assumption a,"|0),=0, that is, no difference occurs concerning the transversal 
components compared with Gupta’s theory. 

Before concluding this section, let us investigate the relationship between the present 
method and Gupta’s old theory. The latter postulates that the metric operator should 
satisfy the following commutation relations : 


[4;(x).°, 7° ]=0, for j=1, 2, 3, 
Aco ie 7°]. =0, 


which make A,(x) Hermitian and A, (x) anti-Hermitian, as seen from (4-2). As the 
commutation relations (4-11) violate the “rule of proper products” mentioned in § 2, 
they are not U-invariant. The same is true for the Hermitian character of A, (x). 


Consequently, Gupta’s old theory is not U-invariant ; namely, it depends on a special 
Lorentz frame. 


(4-11) 


pi ONES 
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§5. Conclusion 


In the preceding sections we gave a method for treating the indefinite metric space. 
Our pseudo-Hilbert space with the inner product “(x|y), forms a so-called “indefinite 
vector space”’ recently presented by Gupta.) Though Gupta’s indefinite vector space is 
a mathematically larger entity, its mathematical characters are yet little known. There- 
fore, it will be inconvenient for theoretical investigations. Our pseudo-Hilbert space is, 
on the contrary, constructed on the Hilbert space which is a well-known concept in 
mathematics. And we can investigate, for example, the question of convergence with the 
help of the knowledge of the original Hilbert space. 

However, the validity of this advantage is restricted to the case where all the operators 
under consideration are bounded in the Hilbert space. Actually, this is not always the 
case. (For instance, the operator of the finite gauge transformation of the electromagnetic 
field does not satisfy this condition.) These difficulties are closely connected with those 
in the current field theory. In this paper we confined ourselves to removing the difficulty 
of Gupta’s old theory without wandering out of the Hilbert space. 

In passing, we make a remark on the formalism of the Dirac spinors. The con- 
ventional 7,-method with the Hermitian representation of the 7’s is evidently a wrong 
one, if the standpoint that the 7-matrices are to be transformed under the Lorentz trans- 
formations is equivalent to that in which the wave function ¢ is assumed to be changed 
instead of 7’s. A way out of this inconvenience is realized by a manner quite similar 


to that given in §5, by abandoning the Hermitian representation of the 7’s.” 
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A. new derivation of the two-nucleon potential proposed recently by the authors is given which 
avoids the ambiguous features of the scattering formalism employed previously. It is based on the 
reduction to a Schrédinger equation of a suitable covariant two-nucleon equation. An approximate 
evaluation of the fourth-order potential highlights the significant departure of the present proposals 


from past perturbation calculations, but represents a serious overestimate quantitatively of the additional 
contributions. 


S 1. Introduction 


In a preceding note” (referred to as A) we described in a qualitative manner what we 
consider to be the soundest method available for obtaining an approximate potential energy 
function for two nucleons, namely as the adiabatic and low energy limit of the kernel 

* V(r, r'; W) of eq. (15) of A. The purpose of this paper is twofold. We wish to 
redevelop the mathematical foundations required for actually deriving the general form of 
V and to obtain its local interaction limit, in order to justify our previous assertions 
‘concerning the latter. 

The sections which follow are not arranged in strictly logical order. Section 2 con- 
tains a descriptive account of method with the results carried only to the point achieved 
previously,” whereas Section 3 first contains the definition and derivation of all the 
required ingredients under the assumption that the meson-meson interaction can be neglected. 
Finally in Section 4, we proceed to a relatively simple albeit approximate evaluation of 


the potential suggested in Section 2, the purpose here being merely to exhibit the order 
of magnitude of the new terms.** 


* Supported in part by the U. S. Atomic Energy Commission and by the U. S. Air Force through 
the Air Research and Development Command. 


A more accurate evaluation and qualitative comparison with experiment has already been carried out 


by Konuma, Miyazawa, and Otsuki, Prog. Theor. Phys. 19 (1958), 17. Our own evaluation and quantitative 
‘comparison with experiment is in preparation. 


+ 


we 
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§ 2. Description of results 


Pursuing the method of A, we start with eq. (1) of that note 
(GPG) W(x) = | dt’ dtm! Tn, 5 8, Hf) POH). (1) 


We immediately specialize I to the case where it describes at most the exchange of two 


mesons, including all self-interactions but omitting non-linear effects of the meson field. 
We then have 


Il=2,-- 4, (2) 
where 


e 


1, (x,% 3 xy ty) = — ig? | dS dS PO Gane AG asl OGeass). GB) 
1, (x,% aa = (— ig?) */ai\dté,--dty A ® (x, x4!3 €&) 

SAE, ) Ate ee so Se) 

= (aig)? | ats d8y (x,|P(G,) GT (E) [xy 

Ae eo ee GI (53) (Xe) (4) 


Here J’ is the vertex operator for nucleon i, 4 is the Feynman propagator for the 
physical meson, and J® is the space-time amplitude for the scattering of a meson by a 
physical nucleon. Equations (3) and (4) are represented diagramatically in Fig. 1, double 
lines representing the propagation of physical particles. The term subtracted in (4), as 
indicated in the diagram is the iteration of I, and therefore constitutes a reducible 
diagram, four-dimensionally speaking. As also indicated in the figure, there then remains 
crossed diagrams with “single” nucleon propagation in intermediate states plus the sum 
of all diagrams which cannot be represented so simply. This statement distinguishes the 
four-point interactions represented by an oval in the diagram from those which are accorded 
a rectangular representation. 

To reduce eq. (1) as actually given to the form of eq. (15) of A would indeed 
require detailed knowledge of the forms of the functions which occur in eqs. (3) and 
(4). Suitable forms for this purpose would be the parametric representations which have 
been so extensively discussed of late.” We shall not pursue the general program here. 
Instead we shall proceed as expeditiously as possible to the static limit of the resulting 
interaction. Under these circumstances it is sufficient to replace the single nucleon and 
meson propagators immediately by the corresponding free propagators with renormalized 
mass values” and to consider from the beginning that the other quantities occurring have 
been renormalized.’ To order g’ then, the form of eq. (15) of 4 (in momentum space) 
can be obtained correctly by carrying out the k, and p, integrals in the following simple 


generalization of an equation essentially given by one of the authors,” 


878 A. Klein and B. H. McCormick 


[W—2E(p) |¢(p) =A.” (p) A. (—Pp) (—A) (271) a d*kdk, dp, 


-1 


x {fav +e |'+[av-r-E@ | | 
“(RA ()- Te (p, p—OL ts —p+k) 
x {AW tek -E(p—&) [+ [3 —ath—-Ep—&) | | o(P—A). (5) 


This differs from equations discussed extensively in Section 3 of reference 6) only in 
that the full vertex operators /’ occur in place of bare vertices. The contribution to 
the integrations over k, and p, from the singularities of these functions will again be 
small corrections to the terms arising from the poles exhibited explicitly.* Indeed as a 
partial adiabatic limit, it is correct to equate the relative energy dependence of these 
operators to zero. The resulting kernel will then exhibit the same structure as the second 
order perturbation theory result, in the notation of 4, 


Ge pak eS 1 (p, p—k)l'?(—p, —ptk) (6) 


w(k)|W—E(p) —E(p—k) —o(k) } 


as asserted previously by the authors.” With slightly altered notation, this is eq. (37) 
of ref. 2), which is so decisive in obtaining a suitable form of the fourth-order potential. 


For the static vertex operators which occur in (6), we are to write, for example, 
P®(p, p—k) =io™ -kt™ p(k) /2M, (7) 


where (k) can be identified with the source function of the fixed source gradient coupl- 
ing theory. 

In order to complete the derivation by extracting the fourth order kernel x“, we 
again follow the method of ref. 6), which informs us to add to the right-hand side of 
eq. (5) an expression of the form 


A, (p) A, (=p) (—#) (2ni)-*| dh, d had, dla dpy (by + £2)" (be + 22) 
x {[2 W+p—E(p) | +[3 W—p—E(p) | {20-70% 
X(p, —k,|J®|p—k,—ky, ky) (—p, k,|7°|—p+k, +k, —ky) 
—P°(p, pak) P2(—p, —p+k) |W + phe Eph) | 
x2 pth E(p—k) | 


XE (Pky, Pky ky) EO (—ptky, —ptk, +k) } x 


* : 
These corrections as well as those to the meson propagator will give rise to potentials of range 1/3 
and less. See ref. 5) for some of these arguments. 


“= *\* T 
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—1 
x {| 37 —pothar+ha—E(p— hi hy) | 


+| 307 +P)—ky—ky—E(p— hha) | bo p—ky— he). (8) 


This contains only the Fourier transform of the first (reducible) term of eq. (4) from 
which is subtracted the iterate of uv, eq. (6). The appropriate normalization for the 
operators J can be inferred from the Born approximation upon comparison with the 
cortesponding equation of ref. 6) and in any case will be given below. 

With the statement of eq. (8), the first object of this paper has essentially been 
achieved. For if we substitute in it for the 2/® the Born approximations thereto,* to 
be called 7, the resulting kernel yields upon static approximation the fourth order 
contribution to the BW potential, augmented only by the finite core size effect contained in 
eq. (7). The potential resulting thus far can be found summarized in eqs. (47) and 
(48) of ref. 2). 

Let us now suppose that the perturbative contributions to eq. (8) just discussed 
have been removed. What remains has a well-defined static limit. (The iterated term 
of eq. (8) is now gone.) It is then safe to take the adiabatic limit prior to the per- 
formance of the relative energy integrations by remarking that in this limit, for example, 


[4W +p.—E(p) +ie]*+[4W —p,—E(p) tie] > — 2710 (py); (9) 


where we have remembered, for the first time explicitly, that all contours are to be taken 
in the Feynman sense. By means of (9) and a corresponding expression for the last 
curly bracket of eq. (8), we find that under the further neglect of nucleon momenta, we 


have for instance 
(p, Pos —k,, kil J |p—k,— ky, Prk — kes ka, oo) 
(0, M; —k,, k|S®|0, M; ky, ky) =(—hy| I” (hb) 


ky), (10) 


where the latter is the scattering amplitude for a virtual meson with indicated energy 
and momentum by a fixed nucleon. This procedure eventually results in an additional 


contribution to the potential describing the exchange of two mesons of form** 


Ov,(r) =41 (27) “| deh, d*k, dk, (ky? — wr +ie) (ky — ws’ +i€) ey 
x<exp| i(k, +h) r| (—k,| 7 (— hy) | — eg) (lea| 7 (ho) |e) 


5 (2h 1G%(—b) |= Ie) (|? (&) hy) | (11) 


* In the following sense: Free particle propagators, corrected vertex according to eq. (7) and positive 


energy nucleon intermediate states only (i. e. only P-wave interaction). . 
** Tt should also be noted that by way of redefinition, the coupling constants have been incorporated 


into the scattering amplitudes in contrast to the previous situation. 
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Equation (11) is equivalent to eq. (51) of ref. 2). The further evaluation can proceed 
either by the methods described there by the authors. suitably augmented to include S- 
wave pion-nucleon scattering or, more conveniently, by utilization of the dispersion relations 
available from fixed source meson theory for the scattering of S and P-wave mesons.’ 
The contribution of higher angular momentum states to the static limit of the two-nucleon 


interaction should be negligible. 
To fix the normalization, we shall record the formulae which are to be substituted 


into eq. (11) for its further evaluation. Here we adopt the notation of Miyazawa,” the 


equations which follow being essentially those given by him. We write 
os (ky) Sh Ai, (12) 


where the S-waves are represented by the expression 


T oS (Ry) =D (hy) 25 T+ E (hg) M755 (13) 
Dk, = T (a, = 24;) —32 (k, ») (a, —d,) > 
E(k) =27 (a,+2a;) +3 (ky/ 4) (a,—4,)- (14) 


We have omitted from D and E integrals over the S-wave cross-sections whose contribution 


can be neglected beside the terms shown. For the P-waves we write 


(k,| JI (ky) 9) = A(k,) T;T,0° k,o-k, 


+ Bk) (75 7;,0 +h, O- hy + 7,7;0-k, O-ky) +C (ky) 7, 7,0-k, 0- ky, (15) 
where, including only the Born approximation and contributions from the resonant 33 
state, 
A(ky) =( f iA a = ; : e ep Oo. ( v0 | . -s : | 16 
U7 k—te TAR ey basisall « w(p)—k,—ie wp) +k, —ie + 18) 
Lot 1 . 
Bb) =| Polo )| Sees ra 
: 127 J w(p) is cotye _w(p) —k,—te w(p) +k, —ie » fe a 
C(k) =—( id } 2 a : dp — Ogg Co») : —— > |. 
LE ky +i€ 367 \ w(p) . w(p) —k,—ie w(p) +k,—ie 


(18) 


The application of these expressions to eq. (11) and the comparison of the resulting 
potential with experiment will be presented elsewhere. 


§ 3. Structure of the interaction in the linear meson approximation 


We turn to the derivation of the results described at the start of the previous section. 
ist ()(x) be the field operator for the nucleon field and d(¢) that for the meson field 
spinor and isotopic indices being suppressed as previously. Let the fundamental field 


Derivation of the Two Nucleon Potential 881 


equation satisfied by ¢/(x) be* 


[yp +M—F (d) ]=0, (19) 


where <# (d) is some given function of 6. We shall further require the presence of an 
external source J($) for 6(¢) as described by a coupling term 


L' (x) =J (x) d(x) (20) 


in the Lagrangian density of the system. Let @ be any operator. Then under the 
assumption that J(x) vanishes both in the remote past and future, and that vacuum states 
|0, (+00) can therefore be defined at these epochs, we can define 


(9) =(0(+ 0) |6|0(— 20) >/K0(+ 20) |0(— 0) 9, (21) 
such that 
(9) |s=-0= (0|4|0 (22) 
is the usual vacuum expectation value. 


The basic formula of the succeeding development is then the statement” (T is 
Wick’s time-ordering symbol) 


(T (6) 9) > =[bE) > — 1070] (F) (9), (23) 
which, together with the definition 
0 O(F))/O] (§") |z-0=1(T (FE) 6) ) =4AEE, &), (24) 


will find repeated application in what follows. Thus, by means of eqs. (19) and (23), 
we find that the one nucleon Green’s function, 


G(x, y=KT(P@™)$Q))), (25) 
satisfies the functional-differential equation 
{rp+M—A[G) —i0/8J}G=1. (26) 
Similarly the two nucleon Green’s function 
Gio (Xs X25 Yur Yo) =PKT ($1) $(%) P (ye) PO)? (27) 
satisfies the equation 
{rp t+M—A[ by —8/0J]} p+ M—A[G)—i0/9J}C2=1, (28) 
where here, for example, 1 is the antisymmetric unit matrix, 
x, %|1]%15 Yo =O (%1— 91) 8 (%— Yo) — 8 (H1— Yo) O(%2—91)- (29) 
Ignoring antisymmetrization in what follows, we write for the formal solution of (28) 


G..=G [g—10/0] |G [d—i0/0] || =o. (30) 


* The arbitrariness of pu (¢) will emphasize the generality of the succeeding approach, 
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Noting that 0/dJ=40/04, it follows that (30) may be rewritten* in the form 
6,=G [6—id[6](0/0$) —id[g](8/02) 1G®(1—id[1] 8/00] |p-x-0 
=exp{—id[4]0°/dd07} G |d—id[d+id|9 0/0%\ 0/00] 
x G [%—id [1] 0/07] e-4=0- f (31) 
The last form of eq. (31) is not exceedingly useful in the general case. In the special 
instance in which we assume that 4 is independent of 4, i.e., 
0"4/(0¢)"=0, n>0, (32) 
which we may term the linear meson approximation, we obtain, however, 


Go=exp[—idd?/A4 076 [—idd/6]G® [7-14 3/07)}, 


= OO ata AE —4) --A(F,—F,'/) 


n=0 n ! 
x [3"G [41/84 E,) 6 E,) Teal "G2 121/02 E) OLE) Vx-0- (33) 


A more general analysis which includes non-linear effects will be carried out in a future 
publication. Here we shall exploit eq. (33), which is more than sufficient for the 
practical purposes of this paper. The various terms of that equation have a simple 
interpretation in that the term of order n represents a propagation with the exchange of 
n mesons, all nucleon self-interaction being included exactly, in principle. 

In our previous work,” the authors used eq. (33) to discuss the two nucleon potential 
via the intermediary of the S-matrix to which it gives rise. As pointed out there and 
more forcefully in the preceding note, such a procedure appears fraught with ambiguities. 
We therefore proceed, in the spirit of this paper, to derive the equation satisfied by G,, 
from the expression (33) which it satisfies. The method is, of course, time worn, the © 


only difference being in the expressions which actually occur here. We write 
Gyp=GPG?+4+G%G°T,.GPE®, (34) 
where [,,, a sum of both reducible and irreducible interactions, can be read directly from 


(33). We further suppose I, to have the form 


Ip=I+ 1G G®T4 = SU(G®GON*, (35) 


n= 


It follows that G,, satisfies the integral equation : 
Gy=G"G®+E6%GIG,,. (36) 


The situation has already been illustrated through fourth order in Fig. 1 and eq. (4). 
Here we need only to record the precise definitions 


P(x, y3 §) =0G"[x, ¥; 4]/d9d(F) |p-0, (37) 


* The considerations following immediately are discussed in Appendix A of Ref. 2), 
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Fig. 1. Feynman diagrams for second and fourth order irreducible 
interaction. The diagram involving ovals contains reducible terms, 
whereas that with rectangles contains irreducible interactions only. 
Double lines indicate the propagation of a physical particle. 


and 
T(x, ¥3 & 9) =(|G*[0°G/08 (F) 06.(y) 1G" |y) gn (38) 
for the vertex operator and pion-nucleon scattering operator respectively. The results of 
eq. (4) can easily be extended to all orders in g. Moreover from eq. (36) for Gy, 
there follows eq. (1) for the amplitude Y(x,, x), 
P(x, %) =CO|T (P(x) $2) ) |), (39) 
where |) is an unspecified two nucleon state, the derivation proceeding by means of the 


limiting procedure of Gell-Mann and Low.’ 


§4. Approximate evaluation of non-perturbative corrections 
to fourth order potential* 


The importance of the additional potential dv,(r) exhibited in eq. (11) may be 


* The results of this section were first reported at the Washington meeting of the American Physical 
Society, April 1955. See B, H. McCormick and A. Klein, Phys. Rev, 99 (1955), 618, 
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demonstrated by means of an approach which represents a further simplification of the 
assumptions of Section 2. We proceed from the appropriate formula for the potential 


of order 2n,'” 


! 
Un (rt) 0G) =~iT-"| de dt, dt, dt,’ dR’ 


/ 


X exp [iM(4,+4—4/— ty’) | Tren (%, » X25 %1> x,') | adiabatic (40) 


> 


which summarizes the experience of this paper. Here T is a “large” time interval 
which will cancel out in the final result. The prime on the integral signifies that for a 
reducible part of Ip, eq. (34), the region of four-fold time integration is restricted to 
exclude times for which there are only two nucleons and no exchange mesons in the 
intermediate state. 

To specialize this result to the calculation of the fourth order potential, we shall 


utilize Ip, in the form (see eq. (33)) 
Tris (%1» X23 Kies) =31(—1)'| d*=,---d*7y, 
X (x,|G7 (0? G/0ba: (F1) Was (F2)) G~ x,') 4(F,— 41) 4 (=,— he) 
X (x2|G™" (0 G/8b.a1 (41) Oa (Ge) ) G~"|22". (41) 


To evaluate the limit of (41) for fixed nucleons, we require the appropriate second order 
functional dependence of the single nucleon Green’s function G|dé| upon an external mesic 
field gd. A consideration of pion-nucleon scattering by one of the authors,” has shown 


that low energy theorems imply the following approximate non-relativistic form for G~'|¢}, 
G-"[6]=G~"[0]— (9/2M) [d*21(8) oetaredi(8) 
+ | ateats71) 1) {p,(/2M) 6) 6) 
+ 09(9/2M)? €; 5 7,4; (¢) (0,.(€"), 06.) 


—A[9/(2M)*) p96: (€) radi (F) 
+,| g°/ (2M) = Cary Cin Ta 71790, (§) r+ (&") } > (42) 
where 
<x] 1 (€) |x’) = 04 (x— &) 04(x— 2’). (43) 


Equation (42) depends on five parameters : ( 9/2M) = (f/ 12), the P-wave coupling constant, 
(f?/47) = .08 two parameters , and 9, which measure the strength of the S-wave and 
a pair of parameters /, and /, which measure the deviation of the P-wave from its Born 
approximation value. A consequence of eq. (42) is that the S-wave phase shifts, as a 
function of meson energy «, are given by expressions of the form 


a; (w) =— (9/47) (u/M) 7 [ +4(w/M)p,}, 
a, (wv) = — (9/47) (4/M) 7 [| 1— (@/M) po). (44) 
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This tepresentation is fully equivalent to that suggested in eqs. (12)—(14) and indeed 
the present approach treats S-waves with sufficient accuracy for all current needs. Using 
the experimental ‘values’? a,/7=0.167+0.012, a;/7=—0.105 40.010, we find from 
(44) (with w=) that p,=.01, oe, .57. 


Similarly the P-wave phase shifts may be written as 
a;(w) = (4/3) (f?/47) 7° (4/) [1 se mA (w/M) (A, ci hy) ] > 
As, =A, = (— 2/3) (f?/47) 7° (4/o) [1 a (w/M) (4, — 245) ] ? 
ay, = — (8/3) (2/42) 75(4/e) [1-3 (w/M) (2, +42) J. (45) 


The representation (45) is equivalent near threshold to the effective range approximation.’ 
With w=y and using the accepted values” a,,=0.22 7°, a;,=—0.047 7°, we obtain 
from the first two of eqs. (45) 4,=2.9 (M/p), 4,=1.3 M/pv. With these values of 
the parameters, we find a value of @,, which is quite close to zero at low energies. 


60 


Singlet even 


- $v 
10 ‘ 


0.01 


0.0015 1.0 
X= Mr 
Fig. 2. Central force in singlet even states. Shown separately are 
the second order potential (V2), fourth order perturbation result 
(V3), P-wave scattering corrections (Vp) and S—P interference 
terms (Vs_p). The total potential which is the algebraic sum 
of these contributions is not shown. 
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We now present the fourth order nuclear forces obtained from the combined use of 
(42), (44), and (45), omitting all details of calculation. We first display the P-wave 
contribution, the origin of the various terms being evident from the coefficients involved, 


and the notation standard : 


wnt =r /4n'nfe-2(—— | a(t 2+ ROO +) 


seon( 4 )[mcen( 2 +2 )enc 2 | 
Se (=) Sites) (qe erce ne Rate 
— (fe/4nytp 2 (2) { aaa 2%) [ GEEE Ke 


Z 


+ (2+ 2x-+2") Ky (x) elt oe eekte tee a Ds 


Triplet even ( central) 


0.01 


0.0015 
0.5 1.0 1.5 

Xeur 
Fig. 3. Central force in triplet even states, 


alt Contributions have 
the same relative significance as in Fig. 2. 
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dye K, (x) + (142) K(x) e| 


—Ls,=220-20)| CPt Kay me toK@ | 


—342(f*/42)*(4/2M)? (2) 
7Le 


43 63 1 43 San} 
K,(2 ( de Sede hoe 
AOS ae herrea + )+ Ko (24)( art )t 


+22 (f3/42)*("/2M)? (2) 2-2 2) 
T 
4 (2) 10 7 : 10 9) 
<= oO G2) K US ee 
; - ot =o, 1(2%)( i a a )+K,(2x)( e ate 3 )| 


2 $.4[ K,20( 3-449) 4-K20( 23-44 )] 


50 
Triplet even (tensor) 


+8V, 


0.01 


0.001 
fe) 0.5 1.0 1.5 
X=pr 
Fig. 4. Tensor force in triplet even states 
Fig. 3, except that there is no S—P contribution 


Notation same as in 
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Baws en ae 6 2. ae A hs 
—6/,( r) (4/2M) ple /*)| x! tee - | 


Xa 0 x 
’ Z 
lee aay Ce ne Ses ier ee: oa 
+ As / AAA 2D) fe) 8 . G2 oO. (e 4 /x) 2 + 3 i 2 + . 
ere |B 12 Be. 2 
— 90.2 5,(6%/2)| 4 +2}. (46) 
3 x to x x 


For the S—P interference contribution, we find 
Vas—p (r) =6 (f?/47) (9° / 47) ft (p, 2M) pL ( LS x") . Ed x*) 


+ (f/47) pan” -2)(2)| K, (209( se : z )+K(2s) a 


reeyine( 2 [ben 24 L)+Ra9 2] oo 


We notice that 4, does not occur in these terms. 


Finally, for the pure S-wave contribution, we obtain 


vas 0) = —3 (92/48)? (0/2M)* Hp 3( 2) K 2x) /* 


a (g?/4n)? (4/2M)* np3(2-)®-28 Ee — a0" |. (48) 


Of these results the first term of eq. (48) was derived some time ago™ by means 
of a special form of the method of this section and used to prove that the S-wave con- 
tribution was indeed negligible compared with that of the P-wave. This continues to be 
true even after the addition of the second isotopic spin term. Because of the exceedingly 
small value of ~, and small value of », the S—P interference potential is also small 
compared with the pure P-terms. To illustrate this conclusion and to exhibit the relative 
importance of the P-terms, especially the “scattering corrections”, we have plotted the 
central force in singlet even states in Fig. 2, exhibiting the various contributions, second 
order, fourth order perturbation, P-waves and S—P wave “ scattering correction ”” separately. 
The corresponding situation for the central and tensor force in triplet even states is ex- 
hibited in Figs. 3 and 4 respectively. A detailed analysis of the potential in Sec. 2 
shows that the effective range approximation overestimates the actual P-wave contribution 
by at least a factor of two and in some cases by close to an order of magnitude. Never- 
theless, Fig. 2 establishes unequivocally the importance of the additional P-wave contribu- 
tions to the central force. On the other hand Fig. 4 illustrates that the second order 
tensor force is still of paramount importance. 


9) 
10) 
11) 
12) 
13) 
14) 
15) 


Derivation of the Two Nucleon Potential 889 


References 


A. Klein Prog. Theor. Phys. 20 (1958), 257. 
A. Klein and B. H. McCormick, Phys. Rev. 104 (1956), 1747. 
H. Miyazawa, Phys. Rev. 104 (1956), 1741. 


Proceedings Seventh Annual Rochester Conference, Sec. IV, Interscience Publishers, New York, 
(1957). 


- Hiida, Iwadare, and Machida, Prog. Theor. Phys. 15 (1956), 189. 


A. Klein, Phys. Rev. 90 (1953), 1101. 

R. Oehme, Phys. Rev. 102 (1956), 1174. 

A. Klein, Phys. Rev. 104 (1956), 1131. 

J. Schwinger, Proc. Natl. Acad. Sci. U.S. 37 (1951), 452. 

M. Gell-Mann and F. E. Low, Phys. Rev. 84 (1951), 350. 

A. Klein, Phys. Rev. 95 (1954), 1061. 

A. Klein, Phys. Rev. 99 (1955), 998. 

J. Orear, Nuovo Cimento 4 (1956), 856. 

G. I. Chew and F. E. Low, Phys. Rev. 101 (1956), 1570. 

S. Barnes, Proceedings Seventh Annual Rochester Conference, Sec. II. 


890 


Progress of Theoretical Physics, Vol. 20, No. 6, December 1958 


Magnetic Properties of the Iron-Group Anhydrous Chlorides 


Junjiro KAN AMORI 


Department of Physics, Osaka University, Osaka 
(Received August 27, 1958) 


The magnetic properties of FeCl, are investigated in detail from the standpoint of the one-ion 
approximation. The lowest orbital state of the ferrous ion in the crystalline field of trigonal symmetry 
is assumed to be doublet, in which the component of the orbital angular momentum along the trigonal 
axis is not quenched. The spin-orbit coupling energy produces a splitting of the spin levels belonging 
to the lowest doublet. In the vicinity of, or below, the Néel temperature (24°K) where only the 
lowest spin level is populated, we have effectively an Ising model, in which the x, y components of 
the spins are completely quenched. The metamagnetic behaviour of this substance at liquid hydrogen 
temperatures can quite naturally be interpreted by this model. Also the susceptibility near the room 
temperature is discussed and it is shown that the paramagnetic Curie temperature is strongly modified 
by the large splitting of the spin levels due to the spin-orbit coupling energy. The crystalline field 
is calculated, and the result seems to support the assumed orbital level scheme. FeCO;, CoCl., NiCls, 
FeBro and Fels are also discussed briefly. 


§ 1. Introduction 


The anhydrous chlorides of the iron-group elements are sometimes grouped as meta- 
magnetic substances, because their susceptibility is more or less field-dependent at low 
temperatures.” Especially, FeCl, is known to have a strongly field-dependent susceptibility 
at liquid hydrogen temperature.” The purpose of this paper is to make a theoretical 
investigation of FeCl,, based on one-ion approximation which has hitherto been applied 
to several antiferromagnetic ionic crystals’? with considerable success, and further to 
discuss the magnetic properties of FeBr,, FeI,, CoCl, and NiCl, from the same point of 
view. 

In 1933, Landau” suggested that the field-dependence of the susceptibility of these 
chlorides might be related to the transition from antiferromagnetic to ferromagnetic state 
at a certain external field. All these three chlorides have the crystal structure of the 
CdCl, type,” which, if one neglects a small deformation, can be derived from the lattice 
of the NaCl type by taking away alternate hexagonal close-packed layers of cations. Landau 
assumed that ferromagnetic interactions align the magnetic moments parallel within each 
hexagonal layer, and that, in the absence of an external field a comparatively weak anti- 
ferromagnetic interaction makes the moments of adjacent layers antiparallel. This picture 
has recently been confirmed by neutron diffraction experiments,” which also have shown 
that the moments are oriented along the trigonal axis in FeCl, but they are perpendicular 
to this axis in CoCl,. 


According to magnetic measurements on powder specimens by Starr, Bitter and 
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Kaufmann,” the susceptibilities of FeCl,, CoCl, and NiCl, under low field show a maximum, 
which is characteristic of antiferromagnetism, at 24°K, 25°K and 50°K respectively, where 
also specific heat anomalies are observed.» The magnetization curve of FeCl, below 24°K 
is characterized by a small initial susceptibility which decreases rapidly with decreasing 
temperature, followed by a very large differential susceptibility and eventual approach to 
magnetic saturation. The saturation of magnetization at high field suggests the occurrence 
of ferromagnetism. On the other hand, in CoCl, and NiCl, the susceptibility shows only 
a small dependence both on temperature and on field strength below the Néel temperature. 
Recently, Bizette et al.” have made magnetic measurements on single crystals of these 
substances, whose results will be referred to in later sections (§ 4 and § 7). 

Neel’ has given a phenomenological explanation of these properties. He ascribed 
the behaviour of FeCl, to the transition from antiferromagnetic to ferromagnetic state, 
whereas for the cases of CoCl, and NiCl, he assumed that the magnetic moments flip 
from the easy axis to the direction perpendicular to the applied field in the layers. From 
energy consideration he concluded that in FeCl, a strong anisotropy energy binding the 
moments to the easy axis must be present. 

In this paper we shall show, by investigating the microscopic origin of the anisotropy 
energy, that the experimental data on FeCl, both in antiferromagnetic and paramagnetic 
states can be reasonably accounted for. We show also that the difference between the 
magnetic properties of FeCl, and those of CoCl, can be explained by such an investigation. 
Our main idea is as follows :— 

Since the chlorine ions form approximately a face-centred cubic lattice, the main part 
of the crystalline field around a cation is of cubic symmetry. In addition to it, there is 
a ctystalline field of trigonal symmetry which arises from the non-cubic arrangement of 
cations, the induced electric dipole moments of the chlorine ions and the small departure 
from cubic symmetry in the chlorine ion arrangement. In the cubic field, the ground 
orbital state of a free ferrous ion, °D, splits into two sublevels, of which the lower one, 
Ll’ is triply degenerate.” The trigonal field splits this sublevel into a doublet and a 
singlet (see Fig. 1). The main origin of the magnetic anisotropy energy is considered 
to be the spin-orbit coupling modified by the crystalline field.” We assume that the 
lowest orbital state is the doublet, because the anisotropy energy favouring the trigonal 
axis cannot be obtained with the singlet as the lowest state. (This is reversed in CoCl,). 
Further we assume that the doublet is well separated in energy from the singlet as com- 
pared with the spin-orbit coupling energy.* 

With these assumptions, we can show that a large splitting is produced by the 
spin-orbit coupling energy among the spin levels belonging to the orbital doublet, since 
the component of the orbital angular momentum along the trigonal axis is not completely 
quenched. The lowest spin level is doublet, in which the z-component of the spin takes 


* As will be discussed in later sections, we cannot judge from the experimental results obtained hitherto 
whether this assumption is valid or not. However, the calculations in the following sections will serve at 
least to the qualitative understanding, even when future experiments invalididates this assumption. The 


calculations without this assumption will be published in another paper. 
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values +2, while the x and y components are com- 
pletely quenched. (Here we take the trigonal axis as 
the z-axis.) In the vicinity of, and below the Neel 
temperature, where only the lowest spin doublet is 
populated, the Ising model is therefore realized for 
the spin system of FeCl,, the magnetic moments 
being tightly bound to the trigonal axis. In the 
following sections, we discuss the behaviour of the 
principal susceptibilities both in antiferromagnetic and 
paramagnetic states and the magnetization curve in 


antiferromagnetic state. The theory can well explain 


the data of the powder specimen of FeCl,, though 
there remains some ambiguous points in the quantita- Fig. 1. Orbital level schone'of Butt, 
tive comparison with experiments. Also the theory 

can explain qualitatively the behaviour of the principal susceptibilities of FeCO, at room 
temperature. 

An attempt was made to calculate the trigonal field, in order to justify the orbital 
level scheme assumed above. The result, however, was found to depend sensitively on the 
magnitude of the effective point dipole moments induced on chlorine ions. If we assume 
the magnitude of it calculated with the use of the polarizability and the magnitude of 
the electric field acting on each chlorine ion, the energy of the orbital singlet is found 
to be lower than that of the doublet. With a. magnitude of the induced dipole moment 
less than about two-third of the calculated value, the assumed orbital level of Fe®* is 
obtained. 

§ 2—§ 6 will deal with FeCl,. In § 2, we derive the effective Hamiltonian of the 
spin-orbit coupling energy and the Zeeman energy for the ground orbital doublet. Using 
the results of §2, we discuss the antiferromagnetic state and the transition from the 
antiferromagnetic to ferromagnetic state in § 3 and § 4; the paramagnetic state is discussed 
in §5 and §6. In the final section we discuss the properties of CoCl,, NiCl,, FeBr, 


and Fel,. Appendices supplement the calculation of the text and give the calculation of 
the crystalline field. 


§ 2. Effective Hamiltonian for the lowest orbital doublet 
of the ferrous ion 


With the assumptions mentioned in § 1, we shall derive the effective Hamiltonian 
of the spin-orbit coupling energy and Zeeman energy for the orbital doublet of the ferrous 
ion by the perturbation method, in a similar way to Pryce’s™ procedure in non-degenerate 
case. 

We take the trigonal axis as the axis of quantization and write ©, the orbital wave 
function specified by the azimuthal quantum number M of the orbital angular momentum. 
The orbital wave functions of the lowest doublet can be expressed as 
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f,=cosU g,+sind g_,, 

‘ahs | (1) 

hy .=cosl Y_»—sind 9¢,. 
The angle &% depends on the ratio between the magnitude of the cubic field and that of 
the trigonal field. If we neglect the effect of the trigonal field, % is tan~'1 I odes The 
_wave funtion of the singlet is given by 


$,=Qy. (2) 


The wave functions of the highest orbital doublet are given by 


hy, = sin? Y,—cos0 o_,, 
aa @) 
(no =sin ¥ O_.+ cost ¢,. 
With these , we can show that in the ground doublet L, is diagonal and (g1|L.|91) 
=— (g2|L.|g2), and other components of L have no matrix elements. The details are 
given in App. I. 
It is convenient to introduce the Pauli o.-matrix which operates in the two-dimen- 
sional space spanned by ¢/,, and ¢/,.. The first order spin-orbit coupling energy in the 
ground doublet is expressed as 


ACL a, S, (4) 
where (L)=(g1|L.|g1). <L) is equal to one, if we assume o=tan7" Lyd, (see 
Appendix 1). The first order term of the Zeeman energy is given by 

pty (254+ (L) 0.) Het 2ftp (HeSeHyS;). (5) 


The first order energy of the spin-orbit coupling, (4), splits the ground orbital 
doublet into five doublets, specified by 


o,=1;'S,=m ‘and o,=—1, S.=—m (m=2, o-« ri 2 


They have a constant spacing |A\(L), and the lowest doublet corresponds to o,=1, Se 
and ¢,=—1, §,=—2. 

The non-diagonal matrix elements of LE connecting the lowest orbital doublet to 
higher states produce the second order perturbation energies of the spin-orbit and. orbital 
Zeeman energies, which contain non-diagonal as well as diagonal terms with respect to 
the spin doublets produced by the first order energy, (4). Since the non-diagonal terms 
are not important in the following discussions because they are small, we pick up here 


only the diagonal terms. They are given by 
H, = — (32 sin? 3/2 4E,) (S2+5,?+0.5.) 
— (#/AE,) {(9/4) sit? 20 $2+824+57—0.5,}, (6) 
H,= (3/,|A| sin’ /4E,) (H,S,+ Hy, Sy) + (42|4|/4E,) 
x {(9/2) sin? 20 H,S,+2(H,S,+ H,5y)}; (7) 
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and 


H,=— (3p,’ sin’3/2 4E,) (H?+H,’) — (#2°/4E,) 
X {(9/4) sin’ 20 H+ AS +, (8) 


where JE, is the energy separation between the lowest orbital doublet and the orbital 
singlet and JE, the separation between the two orbital doublets. H, is the second order 
energy of the spin-orbit coupling, H, the cross term of the orbital Zeeman energy and 
spin-orbit coupling energy, and H, the second order term of the orbital Zeeman energy. 
H, gives the (y—2) part of the Zeeman energy, while H, gives rise to the temperature 
independent susceptibility. 


§ 3. Theory of the antiferromagnetic state 


If we assume (L)=1 and the free ion value of 7, —100 cm’, the spacing of the 
doublets will be |A4|(L)=100 cm, or 144°K. If JE, is larger than 1000cm™, the 
second order term, H,, affects this separation only by an amount less than 10 per cent. 
Therefore, in the vicinity of the Néel temperature, 24°K, or below it, almost only the 
lowest doublet is populated. In this lowest doublet, S, and 5S, have no matrix element, 
and S, and o, can be replaced by the effective spin operator s, of magnitude 1/2 through 


the relations, 


S=4csccand anise. (9) 


Thus we retain only the diagonal part of the Hamiltonian with respect to the lowest 
spin doublet, and such a truncated Hamiltonian is essentially the Hamiltonian of the 
Ising model. 

The interaction between the magnetic moments consists of the exchange interaction 
and the magnetic dipole-dipole interaction. The latter is small, but not negligible compared 
with the former. For the exchange interaction we take the Hamiltonian 


H,.= —2J, >) S;-S;+2), >) S,-S,, (10) 
<#,Jj> <k i> 


which reduces for the lowest doublet to 
Her — 32 J; 31 See” 5y.+ 32 Jo S) See * She 
<g> <k> 
The first sum is the ferromagnetic interaction within the same hexagonal layer, the second 
sum represents the antiferromagnetic inter-layer interactions, and ¢ , ) means a pair. In 
the following, we shall treat these interactions in the molecular field approximation. 

The effective field arising from the magnetic dipole-dipole interaction can be divided 
into three parts: the first is the field arising from the magnetic moments within the 
Lorentz sphere, the second the Lorentz field, and the third the demagnetizing field. The 
total effective dipole field acting on each ion belonging to the-+sublattice is therefore 


H,' =0,M' +0,M> + (47/3) (M* +M-)—N(M*4M-), (11) 
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where /, and @, are the diagonal tensors obtained from the dipole sum within the Lorentz 
sphere, N is the tensor of the demagnetizing factor, and M* and M~ are the magneti- 
zations of the sublattices per c.c. The magnitudes of M, and @, are given in Appendix 
II. For the ions of the — sublattice 2, and @, should be interchanged. 

We discuss first the principal susceptibilities in low field. As discussed before, the 
x, y components of the applied field disappear in the Hamiltonian of the first approxi- 
mation. - However, the non-diagonal elements of 5, and S,, between the lowest doublet 
and the second doublet of S.=1, o.=1 and S.=—1, o,=—1 give rise to the second 
order perturbation energy, from which we obtain 


= 2NGE Be / |AlCe (12) 


Here we neglected the contribution arising from H, given by (8), as it turns out to be 
small. 

For the susceptibility along the trigonal axis we expect the normal behaviour of the 
antiferromagnetic parallel susceptibility. In the molecular field approximation, the effective 
fields to which the effective spins of the + sublattices are subject are given by 


Hi, =H+ ('+,,+47/3—N,) M*— (A—@,,—42/3+N.)M*, (13) 
where A and /’ are defined by 
['=64],z,/N,(49-+2¢(L))? and A=64J,z/N,(49.+2 (L))?. 


z, and z, are the numbers of the neighbours which interact with each ion and N, is the 
number of cations per c.c. 


The parallel susceptibility at the Néel temperature is given by 
Xv mor| r=r y= (N/N,) : (A—@,,—42/3+N,)™. (14) 


Below the Néel temperature we expect that the parallel susceptibility tends to zero with 
decreasing temperature as in ordinary antiferromagnetic substances. 

At a high field, a transition from antiferromagnetic to ferromagnetic state takes place. 
‘This transition was discussed previously by the present author, Motizuki and Yosida” and 
also by Gorter and van Peski-Tinbergen,’” both using the molecular field approximation. 
If the coefficients of the molecular field satisfy the relation, 


1> (4+9,,+42/3—N,) /(A+14+9,.—P,.) = 3/8, (GIGS) 
the first order transition takes place in the temperature range given by 
Od lps 3 0043) (A4+1'+9,,—%,,) /(04+9,,+42/3—N,). (16) 


At absolute zero, the transition takes place when the strength of the field applied along 


the trigonal axis reaches a value, 


H,.= (A—@,,— 47/34 N,)M, (17) 
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where M=|M#|. The transition field decreases with increasing temperature.* 
If the applied field makes an angle of / with the trigonal axis, the transition field 
is given by 
H,=H,,/cos0, (18) 


because in this case x and y components of the field have no effect. However, for a 
powder specimen, we must be careful of the demagnetizing field. If we assume simply 
that each microcrystal is subject to an average demagnetizing field, which is determined 
by the total magnetization of the specimen, and an appropriate demagnetizing factor N,,, 
the transition field for those microcrystals, whose trigonal axis makes an angle of % with 


the applied field, will be given by 
H.=N,I(H,) +H! /cos, (19) 


where I(H,) is the average magnetization per c.c. of the powder specimen at the field 
H, and H,,’ is the transition field we would have for %=0 if there were no demagnetiz- 
ing field. The microcrystals with an angle of % contribute to I(H) a quantity 2M cos’ 
after the transition. So, if we denote the probability of their occurrence by n(%), [(H) 
will be given by 
= 
I(H) =2M | n (2) cos¥ (dit dH.) dH,, (20) 


Hoo’ 


neglecting the magnetization arising from the antiferromagnetic state. If we assume 
n() =sind, i.e., the random distribution of the easy axis of the microcrystals, (19) 
and (20) give the relation, 


I(H) =M{1—H,?/(H—N,I(H))}.. H>H,! (21) 


From (21) we see that the magnetization of the powder specimen is zero at H=H,’ 
(actually small), begins to increase linearly with increasing field beyond H,’, and then 
tends to saturate to M. In a single crystal this takes place at a higher field given by 
H.,.=H,,+N.M. This conclusion is, however, not decisive since the assumption of an 
average demagnetizing field might not be justified in the initial stage of the magnetization. 


§ 4. Comparison of the theory of the preceding section 
with experiments 


First we discuss the principal susceptibilities. 
In order to estimate the order of magnitude of %,, we assume |2|=100 cm=! and 
J.=2.1, the latter corresponding to JE,=1,000cm~!. Then (12) gives 


1) Wor 01023. (22) 


: Yomosa and Suzuki have discussed this temperature effect on the transition field, using the Bethe 
approximation as well as the molecular field approximation. (S. Yomosa and H. Suzuki, yet unpublished.) 
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Further, if we assume (L)=1 and y.=2.0, the sublattice magnetization will be 


M=(N, 42/2) (29. + ¢L)) =357 ¢ per c.c. (23) 
=13,970¢ per mol. 


If we assumed (L)=1.5, we would have obtained M=15,400¢ per mol. The (L) 
value and g-value will be discussed in later sections. Here we use (23) preliminarily for 
order estimation. 

Fig. 2 shows the experimental curve” of magnetization vs. field at 13.9°K, from 
which we estimate the transition field to be about 10,000¢. Using (23) and (14), 


we have 


(0) 10 20 30 0 10 20 30 0) 10 20 30 


Fig. 2. Experimental curves of magnetization vs. field of FeCl), CoCl; and NiCl, below 


their Néel temperatures.” 


H,,!/M=A— ®,, — (42/3) =10,000/357 =28.0 (24) 


and 
Ly moll Ty = 1.4. a 


In (25), we assumed N,=0. If we assume N,=47, we would obtain X 4 miot| 7 = 9-97. 
From (24) and %,.=8.37 (see (A-3)) we have A=40.6, so that in FeCl, the exchange 
interaction between the layers is larger than the magnetic dipole-dipole interaction between 


the layers. 
From (22) and (25) we obtain for the powder susceptibility 


Xs not Bey = 0.48. 


The experimental value of Zp moz|ry” 1 about 0.4. In the vicinity of the Neéel tempera- 
ture, the theory predicts that 7%, is much greater than 7) and therefore the powder 
susceptibility is approximately given by 7 /3, which is in accordance with the experimental 
result that 7, decreases rapidly with decreasing temperature. 


With the assumption of the Ising model and molecular field, the Néel temperature 


is calculated by 
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kT y= (N,/2) (29.4 (LY)? pe? (A404 9,.— Do) - 
Inserting Ty=24°K, we see that the relation (15) is satisfied and that (16) becomes 
OX<T/TyS0.95. 


We show in Fig. 3 the theoretical curve of magnetization vs. field at absolute zero as- 
suming N,=0. This assumption is not very serious, since we see from (23) that the 
contribution of the demagnetizing 
field is at most 47M=4500 4. The 
general behaviour of this curve is in 
accordance with the observation by 
Starr, Bitter and Kaufmann.” How- 
ever, the theory predicts about 
14,000 ¢ per mol for the saturation ge 
value of the magnetization, while 
the experiment gives about 20,000 ¢ 
per mol. This discrepancy may be 
due to the following. The experi- 0.2 

mental value was obtained after 

repeated magnetizations in a high me 

field in order to eliminate a hysteresis 

observed in the initial stage of the 

experiment. Since this hysteresis was 0 1.0 20 3.0 40 H’He 
due to change’ of ‘orientation ‘of’ the Fig. 3. Theoretical curve of magnetization vs. field of 
microcrystals on magnetization, the FeCl, at 0°K. 

assumption of the random orientation 

of microcrystals is inappropriate. Also, the assumed values of (L) and g. might be 
somewhat in errors. 

Recently Bizette et al." made magnetic measurements on a single crystal specimen. 
They found that the magnetization along the trigonal axis attains to a value of 36,5004 
per mol under a field of 24,0004 at 4°K and does not yet saturate. This is quite 
unintelligible theoretically, since even if we assumed the maximum possible value of 2 
for (L), the saturation magnetization would have been no greater than 33,500 4 per mol. 
The present theory seems also incompatible with their experiment in that the observed 
perpendicular susceptibility is larger by one order compared with (23). 


$5. Theory of the paramagnetic state 


We discuss the paramagnetic state on the basis of the following simplified one-ion 
Hamiltonian, 


H=2(L) 8.0, —2Jz(S) $+ tp H. (9.5. +(L) 0.) +142 92(H,S,4+H,S,), (26) 


where 2Jz is the coefficient of the molecular field effective in the paramagnetic state. We 


Magnetic Properties of the Tron-Group Anhydrous Chlorides 899 


neglected the dipole interaction in (26) because their effect is small. (S) is the average 
value of the spin which is to be determined self-consistently. We define m and m’ as 


quantum numbers of S. and o,. Then the susceptibility along the trigonal axis is given 


by 
i= (Nn /kT) (CL)? + 29. (LE) (nm!) + 92 Gm) 
+ CL)? (2Ja/RT ) (Xen! )?— Canty} / {1 — (2Ja/kT) my}, (27) 
where ¢ ) means the thermal average. <m) and (mm’) are given by 
{m?» = 2 sinh (x/2) (4 cosh 2x + coshx) /sinh (5x/2), (28) 
(mm!) =2 sinh (x/2) (2 sinh 2x+ sinhx) /sinh (5x/2), (29) 
with x=|A|(L)/kT. 
In a similar way, the perpendicular susceptibility is obtained as 
1 = (2N G2 pegt/RT) 9) / {1 (AIe/KT) FD} (30) 
with 
f(x) =sinhx sinh (x/2) (4 coshx+ 1) /x sinh (5x/2). (31) 


Since the population in each spin level changes with temperature, there is a specific 
heat anomaly of the Schottky type, which is given by 


AC = R{ (x/2)? cosech? (x/2) — (5x/2)? cosech? (5x/2)}. (2) 


§ 6. Comparison of the theory of the preceding section with 
experiment 


Neglecting the dipole interaction, we obtain the relations 
2Jz=2J,%—2]o% (33) 
and 


8], %/ (29. + (LY) fn = AM= 10,000 6. (34) 


The Néel temperature is defined as the temperature at which an infinitesimal antiferro- 


magnetic molecular field sets in. This condition gives 
kT y= (2), 2%, +2Jo%) (m) =r y° (35) 
Inserting Ty=24°K and assuming |4|(L) =100 cm~, we obtain from (28) and (35) 
iz +2)o%m=24 k/3.996 =8.30 X 10- ere, 


It might be remarked that for the perfect Ising model we obtain 2J,z,+2J,%=24 k/4 


and for the Heisenberg model 2]; z,+ 2J,%=24 k/2. Thus in FeCl, the large anisotropy 


energy enhances the Néel temperature twice as high as the value expected without it. 


From (33), (34) and (35) 2Jz is estimated as 


900 


2Jz=6.0X10-" erg 
or 3.0 cm? .* 


It is known experimentally 
that the powder susceptibility at 
room temperature follows the 


Curie-Weiss law” 
Ln mot=3:59 /(P-4A8)a (36) 


or according to older data obtain- 
ed by Honda and Ishiwara’” 


io ees SUT — Leaks 


Further, according to the data 
obtained at Leiden,’ the para- 
magnetic Curie temperature is 
equal to 20.4°. It is noted that 
the value of 48° for 4 appear- 
ing in (36) is twice as large as 
the Neel temperature. In the 
ordinary molecular field approxi- 
mation we expect that 0<Ty, 
because, as seen from (33) and 
(35), the coefficient of the 
molecular field which is effective 
in the paramagnetic state is 
smaller than that which is effec- 
tive in the antiferromagnetic state. 
Moreover, we have seen that in 
this case Ty is enhanced by 
virtue of the Ising model to 
twice as high as that expected 
for the Heisenberg model. Thus 
in the limit of kT>|A|, we 
expect 9< Ty/2. Actually, how- 
ever, the numerical calculation of 
(27) and (30) shows that an 
apparent high paramagnetic Curie 
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Fig. 4. Susceptibility of FeClo in paramagnetic region. 


curve (a): 


curve (b): 
curve (c): 
curve (d): 
curve (e): 
curve (f) : 
curve (g): 


curve (h): 


X%q calculated with (L)=1 and g,=2; 

%1 calculated with (L)=1 and g,=2.1; 
%/ calculated with (L)=1,5 and g.=2.025; 
x1’ calculated with (L)=1.5 and g,=2.04; 
(1/3) (%9 +2%4)3 

(1/3) (%97+2%1’) ; 

experimental curve after Starr, Bitter and 
Kaufmann ; 

(1/2) (Xi +% 1) (it coincides with (g)) ; 


(*) is the experimental values after Honda and Ishiwara. 


One might question whether this procedure of determining 2Jz is consistent with the theory of §3 
where the dipole interaction was taken into account. However, it can be shown that, even if we include the 
magnetic dipole interaction in the calculation, the correction is less than 3 per cent, 
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temperature results for the susceptibility near room temperature mainly due to the spin- 
orbit coupling energy, (6).* 


1 


If we assume, for instance, 4(L)=100cm=! and g,=2.0, we obtain for 7, near 


room temperature an approximate formula 

X%, =3.85/(T—85). (38) 
Further neglecting the exchange interaction, we have 

X%, =3.85/(T—80). (39) 
For %;, we obtain, assuming g,=2.1, JE,=1,000 cm™ and 4E,=8,000 cm™’, 

1 =3.88/(T+64.9). (40) 
The small temperature-independent susceptibility contributed from H,, (8), has been taken 
into account in (38) (39) and (40). 

The principal susceptibilities and the powder susceptibility for various values of the 
parameters appearing above have been calculated and shown in Fig. 4, together with the 
experimental powder susceptibility. From this calculation we see that, if we assume (L) 
=1.5, a satisfactory agreement is found with the observed susceptibility given by (36), 
whereas with (L)=1, we see no good agreement. This conclusion is to some extent 


indifferent to the molecular field approximation, since the results are insensitive to the 


choice of the value of 2Jz. However, the following experimental condition must be noted. 


In reference 2), it is stated that the AC 
cal des,—! mole} 


flake shaped microcrystals are not 
randomly orientated in the vessel 
but partially aligned putting their 
trigonal axis parallel to the direction 
of the applied field. Therefore, it 
is possible that the measured powder 
susceptibility is given, for instance, 
by (1/2) (%, +%,) instead of (1/3) 
(%, +2%,). If this were the case, 
the theoretical susceptibility with J 
Ua andy de 2 ages a api Fig. 5. The anomalous specific heat of FeCl in para- 
good sete nen with (36). On the magnetic region (calculated with the assumption of |A| 
other hand, the data by Honda and =100cm~! and (L)=1). 

Ishiwata, which are summarized by 

(37), are rather well represented by the curve calculated with (L)=1 and gy.=2. Al 
though the assumption of (L)=1 thus appears plausible, we cannot at present definitely 


eet 1 1 J sated 
0 100. 200 300 400 500 T K 


* If % is expanded in powers of 1/T, the coeficient of the 1/T? term is given by CO. Thus at high 
temperatures one would obtain @ by expanding (27) and (30) in powers of 1/T. We have in this way 
@=8|A|/9k=128°K (|A| is assumed to be 100cm~!) with (L)=1 and g,=2 for %\ and @=0 for 71. 
However, this procedure is not justified at room temperature as the series converge poorly. 
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determine from experimental results the (L) value and other parameter values. 

The calculated anomalous specific heat given by (32) is shown in Fig. 5. 

A little note will be added here as to FeCO,. The ion Fe** in this crystal is 
subject to a trigonal crystalline field, and this substance is suspected to be antiferromagnetic 
below 35°K. Foéx'” has found 6=60°K for 7, and O=—103°K for 7%, near room 
temperature, so that the behaviour of the principal susceptibilities is in qualitative agree- 
ment with (39) and (40). 


$7. Other layer crystals 


It has been found by neutron diffraction experiment’ that FeBr, has an antiferro- 
magnetic structure similar to that of FeCl,, though its crystal structure is of the Cdl, 
type in which the bromine ions form a hexagonal close-packed structure. The orientation 
of the magnetic moments is along the trigonal axis, suggesting the same orbital level 
scheme for the ferrous ions as in FeCl,. Recently Bizette et al.’ made magnetic measure- 
ments on the single crystal of this substance as well as that of Fel,. At their Neel 
temperatures (11°K for FeBr, and 10°K for Fel,) the susceptibility along the c-axis is 
much greater than the susceptibility 
perpendicular to the c-axis, as in \H) 

FeCl,.* No metamagnetic behaviour 

is found in both substances below 7! 

the Neel temperature. We expect i 
that the transition field H, for these 

substances is higher than that for 

FeCl,, since the antiferromagnetic 

interaction may be stronger in these 

substances due to smaller electro- 


negativities of the bromine and iodine 


2MX,, H 
ions. 

In CoCl, and also in NiCl,, Fig. 6. curve (a) : magnetization curve for field applied 
the anisotropy energy seems to favour within the plane, where the anisotropy energy in the 


plane is completely neglected. curve (b): the powder 


magnetization curve, taking account of the anisotropy 
axis, though we have so far no energy in and out of the plane. 


the plane perpendicular to the trigonal 


neutron diffraction investigation for 

NiCl,. The anisotropy energy within the plane may be so small as to permit the moments 
to turn in the plane and become perpendicular to the applied field even when the field 
is weak. With increasing field strength applied in the plane, the specimen will exihibit 
the perpendicular susceptibility up to a field strength of H,=2M/7,," at which the 
magnetization will reach ferromagnetic saturation (see Fig. 6 (a)). Since the effective 


* Tt is found, however, that x; is higher than the value given by (25) by one order. This discrepancy 
might be due to our inappropriate assumption of 4E,>>|4|. The detailed analysis without this assumption 
will be given in another paper. 
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field which turns the magnetic moments in the plane and brings them to ferromagnetic 
saturation will depend on the direction of the field, one will expect the magnetization of 
the powder specimen as characterized by an initial increase corresponding to a certain 
average susceptibility, followed by a certain range corresponding to ¥,, and then by a 
gradual approach to ferromagnetic saturation (Fig. 6 (b)).* The experimental results 
for the powder specimens of CoCl, and NiCl, below their Neel points are qualitatively 
accounted for by this picture. We shall discuss these two crystals a little more in detail 
in the following lines. 

In CoCl,, the orbital state of the free cobaltous ion, “F, is split into three sublevels 
by the cubic field, the lowest sublevel being triply degenerate, and this lowest sublevel is 
further split into a doublet and a singlet by the trigonal field. We assume that the 
singlet is lower than the doublet; this assumption is not unreasonable, since it can be 
shown (see App. III) that the same trigonal field as that which makes the doublet lie 
lower than the singlet in FeCl, gives the reversed level scheme for CoCl,. However, the 
energy separation between the singlet and the doublet may not be large compared with 
the spin-orbit coupling energy. The combined action of the spin-orbit coupling and the 
trigonal field will split the spin-levels belonging to the singlet into two Kramers’ doublets. 
Evidently, the separation between these two Kramers’ doublets decreases with increasing 
orbital separation; with an orbital separation of 1000 cm™', the separation of Kramers’ 
doublets will be about 100°K, and in the limit of vanishing orbital separation, it will 
be about 500°K. Thus we can safely assume that in the vicinity of the Neel tempera- 
ture (25°K) only the lowest Kramers’ doublet is populated. 

Confining ourselves to the lowest Kramers’ doublet, we can rewrite the Hamiltonian 
in terms of the effective spin operater s, of magnitude 1/2, to which the true spin, S, 
is related by 


S=a18, Says, wy =a s,- 
In our orbital level scheme, @, is smaller than a, and in the limit of large orbital 
separation, @,—=1 and a, =2.** Thus the exchange interaction, 2JS,-S,, (either intra- 
layer interaction with J<0 or inter-layer interaction with J>0) is rewritten by s, and 
2] {Oy Seg Seg H+ 1? (Sei Sap Saye (41) 
From (41) we see that the exchange energy is highly anisotropic in the lowest doublet, 


so that the spins orient perpendicularly to the c-axis in the antiferromagnetic state in 


* The initial average susceptibility will be given by 1/3(4)+%1+% 1c), where %) and %) are the 
ordinary parallel and perpendicular susceptibilities, corresponding to a field applied in the plane, and %\j< is 
the susceptibility for a field applied parallel to the c-axis; we expect %\, to be substantially smaller than % 1 
if there is a large out-of-plane anisotropy energy, as in the case of CoCl,, but otherwise %-=%1. The 
second stage of magnetization will correspond to (2/3)%1 +(1/3)%j¢ if Xy¢ is not equal to %|. 

** In the limit of large orbital separation, the lowest doublet is specified by S,==:1/2, as one will 
easily understand by considering that the effective Hamiltonian of the second-order energy of the spin-orbit 


coupling is given by DS”, with positive D. 
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which the spins in the same layer point parallel and those in adjacent pee antiparallel. 

The single crystal measurements by Bizette et al.” show that there is a large a 
tropy of the susceptibility, i. e., below and above the Néel temperature. In fact, 7%). ” 
about seven times larger than 7, below the Néel temperature. We can understand this 
large anisotropy by considering that in antiferromagnetic state 7), is related to a weak 
inter-layer antiferromagnetic interaction and 7,, mostly to the anisotropy of the strong 
intra-layer ferromagnetic interaction given by (41). Also, a possible anisotropy of the 
g-factor might play a role. 

The transition field to ferromagnetic state is given by H.=2M/7, when the field is 
applied within the plane. If we take 7, =0.4 after Bizette,” supposing his 7). to be 
%,, and assume 3/4, per ion for M, we obtain H,=—45,000¢ at absolute zero. With 
increasing temperature, H, will decrease as the sublattice magnetization will decrease. 
This prediction is in accordance with the observation on powder specimen” which shows 
that at 20.4°K the magnetization begins to saturate at about 28,000 é (see Fig. 2). For 
the field applied along the trigonal axis, H, is expected to be much higher. 

In NiCl, the anisotropy energy is expected to be smaller than in CoCl,. The single 
crystal data show no appreciable anisotropy of the susceptibility. This suggests that the 
anisotropy energy is much smaller than the inter-layer antiferromagnetic interaction whose 
order of magnitude is estimated from the relation A=1/7%, and is 2J),z,=7 cm. 
The main origin of the anisotropy energy must be the magnetic dipole interaction and 
the second order perturbation energy of the spin-orbit coupling, since the ground orbital 
state of Ni’* is non-degenerate already in cubic field. The dipolar interaction will favour 
the plane perpendicular to the trigonal axis, with an energy difference of about 0.5 cm™ 
between the axis and the plane. To determine the second part of the anisotropy energy, 
a detailed knowledge of the crystalline field would be required. 

If we apply a field, the sublattice magnetizations of NiCl, will turn in the basal 
plane and make an angle with each other. If the field, which is applied with an angle 
to the plane, is increased, the sublattice magnetization will come out of the plane and 
at a field of H,=2M/7, transition to ferromagnetic state will take place. Using Bizette’s 
value of ¥, ~ 0.1, we obtain H,=100,000 6 at absolute zero. The powder susceptibility 
data” are not inconsistent with these predictions. However, Bizette’s single crystal suscep- 
tibility shows a peculiar behaviour: it is field-dependent in the vicinity of the Neel 
temperature of 50°K, shows a minimum at 33°K, and below 33°K it ceases to be field- 
dependent. Both the powder and single crystal measurements give a positive paramagnetic 
Curie temperature, (, of 68°K” or 67°K” for the susceptibility near 300°K, which could 
be understood qualitatively if one remembers that the major exchange interactions function 
ferromagnetically between ions in the same layer. Since, however, 9 >T , the simple 
molecular field approximation which always predicts 0 < Ty, will not apply. 

Finally, it might be noted that the ferromagnetic interaction acting among the cations 
in the same hexagonal layer in all these chlorides can be understood by considering possible 
superexchange mechanisms. The author will deal with this topic in another paper. 


The author would like to express his sincere thanks to Professor T. Nagamiya for 
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Appendix I. Matrix elements of £ 
The matrix elements of L, with respect to ¢ defined in § 2 are abaned as 
(g1|L.|g1) =— (g2|L.|92) =1+ (3/2) (cos20— (1/3)) 
(b1|L.|91) = (62|Lelg2) =— (3/2) sin20 
(b1|L.|61) = — (62|L.|62) =1— (1 |L|y1), (A-1) 
other elements being equal to zero. The matrices of L, and L, have aa diagonal elements. 


Defining L¢=L,+iL,, the matrix elements of L* between the lowest doublet and other 


states are given by . 
(g2|L*|g1) =0, (s|L*|91) = (|L-|92) == 6 sind, 
(41|L*|g2) = (42|L-|g1) =2. : (A?) 


Appendix HI. Evaluation of 0, and @, 


First we discuss the approximate value of %,.. As defined in § 3, ,, is the coef- 
ficient of the inter-sublattice dipole interaction. In FeCl,, the layers of cations are well 
separated from each other by the layers of anions, so that we can assume safely that the 
magnetic dipoles in the different sublattice are continuously distributed in each layer. The 
continuous dipole layer which extends infinitely does not produce a magnetic field outside 


the layer. This means 
@,, M+ (42/3) M—4zM=0. 


Thus we obtain 


@,, =87 /3 =8.37. WOCA 3) 


Actually Suzuki’ obtained %,.=8.46 for MnBr, by a rigorous calculation. In MnBr,, 
the separation between the adjacent cation layers is 6.19 A and the distance between the 
nearest- neighbouring Mn ions within a layer is 3.82 A, while in FeCl, the corresponding 
distances are 5.85 A and 3.58A, the ratios coinciding within 1 per cent. Thus the 
error in (A-3) will be within 2 per cent. For %,, we.separate the contribution from 
the dipoles within the same layer from those from other layers. For the latter we obtain, 
by the same consideration, a value 87/3. The former has been calculated by Suzuki"? 

as —30.2 for MnBr,. Considering the difference of the lattice constants, we obtain the 


corresponding value for FeCl, as —30.3. Thus 9%, is obtained as 


@,,=—21.9. (A: 4) 
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The error in (A-4) will be smaller than that in (A-3). The x, y components of @ 
can be obtained from (A-3) and (A-4) through the relation, @,=9%,=— (1/2) 9.. 


Appendix HI. Calculation of the crystalline field 


According to the paramagnetic resonance experiment on NiCl,, the g-value of the 
nickelous ion is found” to be y=2.25. Since nickelous ion has a non-degenerate orbital 


level in the cubic field, its y-factor can be expressed approximately as 
g=2+8|4|/4E, (A-5) 


where JE is the separation between the ground orbital level and the first excited orbital 
level in the cubic field. From y=2.25 and (A-5) we see that the separation of the 
orbital levels due to the cubic field is of the order of 1 eV., or 10° cm (A=240 cm"). 

It is difficult to calculate the cubic field, the major part of which comes from the 
chlorine ions in contact (see ref. 4) for the discussion of the crystalline field). Therefore 
an attempt was made to calculate only the trigonal part of the crystalline field. We can 
infer from this part an approximate orbital level scheme, assuming that the splitting due 
to the cubic field is about 1 eV. 

In FeCl,, we may neglect the trigonal field arising from a small departure from the 
cubic arrangement of chlorine ions. The main trigonal field will be contributed from the 
surrounding cations and the induced polarizations of the chlorine ions. For the latter 
we assume a point dipole moment of magnitude p pointing in the direction of the c-axis. 
The potential arising from these sources can be expressed as 


V = (Cy t+ pC’) 7° Py (cost) + (Cup + pC’) 1* P, (cos #) 
+ (Cys + pCy,") r* Py (cos #) cos 3¢. (A-6) 
We calculated C,,, Cy’, etc., by the direct sum method.” The results are listed 
in Table I. 


To determine p, we calculate the electric field, E, at each chlorine site. This is 
expressed as 


E=E,+pE’, (A-7) 
so that 
p=a(E,+pE’) or p=aE,/(1—aE’), (A-8) 

where a is the polarizability of chlorine ion. Assuming a=2.97 A’? we obtain p= 
0.751 eA, where e is the elementary charge. 

Since the Hartree or Hartree-Fock function is not available for Fe?*, we calculated 
the average of 7° and r* using the Hartree-Fock function for Mn?* *» and also the Hartree 
function for Co**.”” The former gives (r?)=0.41 A® and <r") =0.375 A* and the latter 


<r peuiae® A® and (rt) =0.53 A‘. If we use alternatively the Slater function for Fe*, 
we obtain (r°)=0.90 A® and (r*)=1.31 A‘. The maximum point of the radial density 
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of d-electrons of Mn** is more distant from the nucleus in the Slater function than in 
the Hartree-Fock function. However, according to the result of neutron diffraction ex- 
periment for Mn*', the maximum point lies between the value given by the Slater func- 
tion and that given by the Hartree-Fock function. Considering these facts, we assumed 
arbitrarily (r?)=(r*) =0.45 (in A unit) in our calculation. 

We find that the potential determined in this way makes the orbital singlet lower 
than the doublet. The results for JE,, JE,, % and (L) for three values of p are listed 
in Table II. It is found that for p-values smaller than about 0.5 eA the doublet comes 
out to be lower than the singlet. This result does not depend sensitively on the choice 
of the values of (1°) and (r*), because the sign of C,,+pC,’ is mainly responsible to 
determine the sign of JE,. 

Smaller p-values are not unreasonable, since one would expect that the overlap of 
the charge clouds of the cation and anion will diminish the potential at the cation arising 
from the chlorine dipole moments. Also the repulsion between the electron clouds will 
tend to diminish the polarization of the chlorine ion. 

In the case of CoCl,, the orbital levels were calculated by using the same potential 
with p=0. In this way the desired orbital level scheme is obtained and the separation 
between the singlet and the doublet turns out to be about 1000cm™’. For certain large 


p-values, the level scheme is reversed. 


Table I 
Be eco hs ae Oe a a ee ae Seen Oa 
c. —0.007,26 e2/A5 Cw ! . —0,009,45 e/AS 
Cy 0.000,049 2/A5 | Ca | 9,001,033 e/AS 
E ——-9.418,9 ofA?” 7 a Ff ~ £9.221,2 1/A3 


pase ag a IE a er nc i ll eae A ae acc te 
used lattice constants: a=6.20 A, a=33°33’, u=0.25. 


Table II 
p=0 p=0.37 p=0.74 
; AE), (cm=) 9223 8017 7032 
AE, 2540 420 — 1589 
L 135) 1.162 0.892 
cos } 0.8853 0.849 0.794 
sin 0 0.465 0.529 0.608 
——— EE aa | CL EL LL 
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15) 
16) 
17) 
18) 
19) 
20) 
21) 
22) 
23) 
24) 
25) 
26) 
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It is shown that the special mass reversal MR(1) and MR(2) can be applied to both the weak 
Fermi type interaction and the weak Yukawa type interaction without any internal contradiction in 
the theory of MR(4) and MR(3) (for strong interactions). The selection rules obtained are con- 
sistent with experiment and suggest taking the universal Fermi interactions with V—A coupling as 
the primary interactions for all the lepton processes. The mass reversibility of the theory under 
MR(1) requires that the space parity is not conserved in all the weak interactions and that the mass 
of the neutrino must be exactly zero. MR(2) invariance allows only a linear combination of (AV) 
couplings when the coupling is independent of the explicit masses. It is assumed that leptons are 
conserved and are C or D type fields under mass reversal, guaranteeing the conservation of baryons 
automatically. From the requirement of MR(4) and the consistency of the theory, the neutrino is 
shown to be a mass parity doublet. It is shown from the application of mass reversal to the lowest 
order diagrams in g that the unwanted lepton processes are forbidden or explained even when the 
pry e~ and y are taken as the particles. The prediction of the rarity of x>e+yv, r>e+v+7 and 
K->e+y is given by extending the universal interaction proposed by Huang and Low in such a way 
that it is invariant under mass reversal. If this is true, the neutrino must have four components in 
a strict sense, and in the usual experiments it is observed as if it were a two component field. If 
MR(3) is assumed to be applied even to Fermi interactions, MR(3) requires uniquely that the uni- 
versal interactions have V—A coupling, and the helicity of the neutrino is —1. The decays K—>(z) 
+y+y/ are shown to be ruled out by the mass parity conjugation invariance and MR(1), provided 
the leptons are C or D type fields under mass reversal. The mass parity is shown to be related 
closely to the strangeness which is the important difference between our theory and Sakurai’s one. 


§ 1. Introduction 


It is pointed out in a previous paper” that mass reversal is a kind of spinor transfor- 
mation and is independent of charge conjugation, space reflection or time reversal. Thus 
special mass reversals (for example, MR(1) and MR(2)) can be considered consistently 
with usual mass reversal MR(4), which means that particles for which mass reversal is 
applied are in principle chosen arbitrarily. Hereafter we shall denote the operation of 
mass reversal simply by MR(n) where n is the number of particles whose masses are 


being reversed. The weak interactions of Fermi type are taken in this paper as the most 


*) A preliminary paper on Mass Reversal and Lepton processes was circulated in December, 1957, 
which was the revised version of the report in collaboration with K. Senba and M. Yonezawa which ap- 
peared in Soryushiron-Kenkyu (mimeographed circular in Japanese) 14 (1957), 618, And. See, also, T, 
Ouchi, K. Senba and Y. Yonezawa, Nuovo Cimento 8 (1958), 708. 

**) On leave of absence from Hiroshima University, 
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probable primary interactions responsible for the lepton processes. It is one of the main 
problems to examine whether these special mass reversals MR(1), MR(2) and the ae 
MR(4) can really provide the useful selection rules for the lepton processes in a consistent 
way with both strong and weak interactions in the baryon-meson system. 

An attempt very similar to these special mass reversals was recently proposed by 
Nishijima”? to derive the selection rules for lepton processes. However, we claim that 
the assumption of invariance under mass reversals MR(1) and MR(2) is less restrictive 
than Nishijima’s assumptions to produce essentially the same results. The meaning of 
mass reversal will be more clear and the assumptions in this paper will be justified once 
the structure of the elementary particles is established. 

The main results obtained by applying MR(1), MR(2) and MR(4) for leptons and 
baryons are the following : 

From the requirement of mass reversal MR(1) invariance, it is concluded that the 
space parity is not conserved in all the weak interactions and the mass of neutrinos must 
be exactly zero. Consequently the Fierz interference term in -decay and the magnetic 
moment of the neutrino must be exactly zero. From the MR(2) it is shown that a 
linear combination of (AV) couplings is allowed when the couplings do not include the 
explicit masses and that a linear combination of (STP) couplings is possible when the 
couplings include the explicit masses in the proper way (for example eq. (18) is the case.) 
From MR(4) and the consistency of the theory, the neutrino is shown to be a mass parity 
doublet. Then the leptons are assumed to be conserved and to be C or D type fields 
under mass reversal to guarantee the conservation of baryons. By taking the #7, e7, v 
and vy’ as the particles and the Fermi interactions with V — A couplings, Sachs’ assump- 
tion that all the lepton processes must involve at least one neutrino is essentially proved. 
The decay modes such as K-x+y'+ p>, K-xz+e*+e~ which are not forbidden by 
Konopinski-Mahmoud’s” procedure are forbidden from the requirement of MR(4). For 
the prediction of the rarity of the modes 7—>e-+y+ (7) the universal /—A interaction 
was modified so as to be invariant under mass reversal. The new terms needed by this 
modification are shown to have no contribution for those processes, so that these new 
interactions are reduced to Huang-Low’s" universal interaction. Consequently the 7—>e+ », 
etc., are predicted in the same way as Huang and Low. If this is taken seriously, the 
neutrino is described by a four component field in a strict sense and only the proton and 
electron are always left-hand polarized. This is probably due to the fact that both electron 
and proton are stable. MR(3) was defined to forbid the mode p—->e+e*-++e~ and seems 
to imply that the universal Fermi interactions have the V—A couplings and the helicity 
of the neutrino is —1 which is consistent with the latest experiment”. The decays K-> 
(7)+v-+y! are shown to be ruled out by the mass parity conjugation invariance and 
MR(1) in V—A theory provided the leptons are C or D type field under mass reversal. 

The most important points to be emphasized are that our mass reversal is applied 
both to strong and weak interactions and the mass parities are related closely to the stran- 
geness and that the weak interactions for baryon and meson systems are the universal 
Yukawa type interactions with gradient coupling. These are the essential differences 
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between our mass reversal and Sakurai-Hori-Wakasa’s”) theory. The comparison of the 
prediction of the universal /— A interactions was not presented in this paper in order to 


avoid the overlap with those papers by Feynman-Gell-Mann” and by Sudarshan-Marshak” 
and by Sakurai. 


§ 2. Outline of the theory of mass reversal 


In order to clarify the meaning of mass reversal, it will be useful to recall Nishijima’s 
theory” for leptons. He found that the following two sets of equations (I) and (II) 
yield the same S-matrix, but not the same Hamiltonian in quantum electrodynamics. 


He (0,— ieA,) +m} b=0 


ieee — —1e gp Vu i (1) 
Ve (0,= ieA,,) 4 merato\ wy ==() 
04,.= —1e (b Tp pb (II) 


(a: real constant), 


This fact is due to the special property of 7, that 7, can absorb the transformation factor 


e7 2eTs 


appearing in the physical quantities in such a way that e~‘*™ 7, e-™%=y,. Next 
he assumed that the two sets of equations above mentioned yield the same S-matrix even 
in the weak lepton processes, and derived several interesting results which seem to be 
consistent with experiment. 

Although the essential point of the Nishijima theory is that @ is arbitrary, we may 
fix a to 7/2. Then his attempt is equivalent to the concept of our mass reversal. This 
will be the most simple explanation of the mass reversal is less restrictive than his theory 
in the sense that @ is fixed. In fact the theory of mass reversal can be developed more 
generally in a similar way as space reflection. 

In g-number theory of mass reversal, we have shown in the previous paper” that mass 
reversal is equivalent to a kind of spinor transformation and, therefore, can be applied 
even to the neutrino with zero mass. The arguments can be formulated in the interac- 
tion representation as follows : 

The mass-reversibility is to assure for S-matrix the relation 


CfIS|D=Ch «Sle || «) 
mechs —8| 9 |=] 17 —#) 
=(f'1S'|#) (1) 
where i(i’) and f(f’) represent the initial and final states respectively and S$’ indicates 
that it is constructed from a set of equations of type (II) in the same way as S is 


made from set (I). If there exists a time-independent unitary operator M for mass reversal 


satisfying |i’ >=|1; —«> =Ml\i>, eq. (1) is translated into 


S’=MS[ « |M-* (2) 
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or in terms of interaction Hamiltonian H, 


W =H —«|=p,(MH (x) M~). (3) 


And in general for any physical quantities Q’s we have 
Q!=01 —K]=7 (MQ (©) M~) (4) 


where / is a sign function for @ under mass reversal. In usual case //, must be taken 
as+1 as H is energy. H(«) contains, in general, a certain number of explicit masses 
x, and fields (,(«). Then H(—«) has the same form as H(«), but with explicit masses 
—x, and fields ;(—x«) instead of ¢/;(«). Let us take as the most general interactions 
of Fermi type 

H=Sif.(¥" Oz $) (F On #*) (5) 


where the order of fermions is fixed for simplicity so that both a and 6 are baryons or 
leptons, and take into account the special mass reversal MR(1) and MR(2) together with 
the usual MR(4). This possibility was pointed out in the previous paper? from the 
physical meaning of mass reversal. As the double MR(2) is in a certain case equal to 
MR(4) and double MR(1) is in a certain case equivalent to MR(2)*, it will be a 
sufficient extension to assume the value of /,, for MR(2) and MR(1) to be (+1) and 
(+1, +1) respectively. Corresponding to this, eq. (1) should be modified slightly. The 
meaning of these assumptions will be clarified in the next section. 

As was shown in the previous paper'’, the operator M exists and the spinor fields 
(); the boson ¢; transform as follows : 


Moh; (x, K,) M—"=a, 7s $i (x, —*), 
Mo; (x, K;) M =u, 0; (x, —f.) 


(6) 


where «, is the relative phase of ¢/,; under mass reversal and takes the value + 1, +i and 
«wis the mass parity of bosons, taking the value +1. Now we can derive the selection 
tules from the requirements of mass reversibility of the Hamiltonian. Assuming that the 
fx do not involve any explicit mass, we get from eqs. (3), (5) and (6), 


Wq*O,Wc*Oa=1, from MR (4) (7a) 
to,*og=a with a=+1, from MR(2) (7b) 
where we take the-+sign for O, with V or 4 coupling and the—sign for O, with S, T 
or p coupling. 
O. 75 W2@=8O, with f=+1, +t from MR(1) (7c) 
As w, is limited to the value-+1 and-i, eq. (7c) means 


*) This situation will be more clearly understood from the following fact. The operator M for 
MR(4) is given by M=M,M,M,Mzy where M, is the solution of eq. (6) for M and the mass reversal. 
operator for MR(1) applied to the i-particle. Therefore the double MR(1), i. e. the product of MR(1) 
with M, and MR(1) with My, is equal to MR(2) with MjM,, and so on. 
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O,=0y (147) or O=0,'(1 =i7;) (8) 


where the case O,=O,'(1+i7;) is forbidden if the time reversal invariance is preserved. 

(7b) means that there occurs either a linear combination of S, T and P couplings 
or a linear combination of V and A couplings, which are the same results obtained by 
Tiomno”. (8) means that space parity is not conserved in the weak Fermi interactions. 

It will be noted that MR(2) is a special case of double MR(1) and should be 
applied to lepton pair or baryon pair, and that the relationship 8, (for STP couplings) 
=—/, (for AV couplings) is assumed in the following discussions because eg. (7c) 
shows that (7, will depend on the type of O,. 


§ 3. Application of various mass reversals to leptons 


Now we are ready to apply all kinds of mass reversal to Fermi type interactions 
responsible for lepton processes. As the form invariance of the theory under MR(1), 
MR(2) and MR(4) simultaneously is required, eq. (1) should be replaced correspondingly 
by 

[<fs «lSle]lis e)P=|Cfs—«|S[—«]lis —«)|? (9) 
where |---|? implies that one takes care of the average over the initial spin states and 
the summation over the final spin states. The above change is necessary because a and 
2 for MR(2) znd MR(1) are not always equal to +1. 

To assure the consistency of the theory of mass reversal, the mass parity assignment 
for hyperons and mesons is necessary. We have formerly'” proposed the form of the 
strong interactions and the universal weak interactions for the hyperon and meson system. As 
the most probable primary interactions of the Yukawa type for this system, we suggested 


Gd. 75 $y ¢, for the strong interactions (10) 


and 
I Pais’ 1x Yo be for the weak interactions, Cit) 


where G and g are assumed to be natural constants’? with the magnitude of order 1~10 
and 10-"%~107' in units of h=c=y (7 meson mass) similar to e for the electromagnetic 
interaction and 7,’ is taken to be either 7, or 1 if the space parity 1s conserved in the 
weak interactions, but all 7,’ can be equal to 1+7; if the space parity is not conserved 


in the weak interactions. Assigning the mass parity as follows : 
SN re And ‘eB eme= 1, erat, 
A,e+1. (12) 


where € means that =, N belong to the fields of type A’ under mass reversal, etc., we 


can derive good selection rules which are consistent with experiment. If we connect the 


5 9 ie a | ae ee A ae a 
mass parity with the attribute a 50 that w=(—1)* for fermions and w (—1) 


for bosons, our selection rule for hyperon and meson system reads from MR(3) with 


Pa=+1 in (3) 
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‘oO 
bo 


(a a Cee (13) 


where i and f indicate the initial and final states, the summation should be done over all 
the particles present in the initial and final states and n.W. means the number of vertices 
with weak coupling g in the Feynman diagram. From (13), we can derive the even-odd 


rule, 


da=0 (mod. 2), for the strong interactions 


da=1 (mod. 2), for the weak interactions. (14) 
But eq. (13) can be taken more naturally to imply that Sachs’ rule can be obtained 
if a’s are chosen in such a way that decay =-—-N+<77 is forbidden 
pa=n=nW (15) 


Although Sachs assumed that the transition having the larger n becomes much weaker, 
the strength of the transitions having dJa—n can be estimated in our theory to be of 
order y”, therefore very weak unless n=O or 1. This is the reason why there exist the 
strong and weak interactions observed in the hyperon-meson system.* 

In order to discuss all the weak interactions in a unified manner, the knowledge 
about a and /# is needed from the interactions established for the baryon and meson 
system. As is easily shown, the weak interactions (11) with 1+7, for 7,” are invariant 
under MR(1) and MR(2) with a=—1. Consequently we assume in general the follow- 
ing relations : 


a=-+1, when >) s,;=even, neutrino » is produced for the lepton process 
z 


a=—1, when >} s,;=odd, neutrino v’ is produced from the lepton process, 


; (16) 


where the summation runs over the baryons contained in the interaction and s represents 
the strangeness of the particle. (That the leptons have zero strangeness is assumed.) 
This will also be natural from the point of view that the strangeness of the nucleon is 
zero and the strangeness is related to the mass parity and that the strong interactions 
satisfy under MR(2) the similar relationship as (16) which has the opposite sign for a. 
Now let us apply MR(4), MR(2) and MR(1) to the Fermi interactions n—>pte+tp 
and A->p+e-+v/ as the typical examples. Then we have from eqs. (7a), (7b) and 


(7c), provided the coupling constants do not contain explicit masses, 


* * 


Wy" Wn OW.” Oy=1, w,* Wn We* Oy =1 (17a) 


and O,=7.(1+7;) for all the Fermi interactions (17b) 


where neutrino y in /?-decay is distinguished from the neutrino in A-decay. (17b) implies 
that the universal Fermi interaction is possible between baryon and lepton families and 


*) Similarly it is derived that the lepton processes having 4a=O are allowed and their strengths of 
the transition are of order g, provided that they proceed through the interactions (10) and (5). 
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between lepton and lepton families. And wy=—«y, is deduced from eqs. (12) and (17a). 

In order to obtain definite solutions for w’s from (17a) it is assumed that leptons 
are conserved and all the leptons belong to C or D type fields’ under mass reversal. Then 
Op=O.-=, €C and wy €D (or w.=a,=0, € D and wy, €C) is the solution and 
the conservation of baryons is preserved in the same way as treated by Yang and 
Tiomno’”. If we apply MR(1) to baryons in order to obtain the knowledge about / 
we find the relationship between 9 and w so that 2,=/), if w,=w, due to Of, Ieee 
This relation suggests indirectly that (16) was a good choice. By using this relation- 
ship, both » and »’ are shown to have the same helicity which means that neutrino 
is a mass parity doublet'?. This strange situation is not the defect of the present 
theory. Rather it originates only from the formal ambiguity of mass reversal for the 
neutrino due to its vanishing rest mass. 

It is easily shown by using the same knowledge about a and / such as (16), and 
8,(for AV couplings) =—, (for STP couplings), etc., that the following -decay in- 
teraction with the coupling constants involving the explicit masses is invariant under 


MR(1), MR(2) and MR(4): 


GP, Hp (L£7s) Pury) (Pe Te A £75) Prawn) +E is (i, Cai von) (C.F) Gon) 


Pp 


+ tensor coupling| ; (18) 


where m, and m, are the masses of electron and proton. This interaction implies that 
the proton and the electron are always left-hand (right-hand) polarized when the strong 
interactions are switched off and the small amount of right-handed (left-handed) 
neutrino can exist together with the normal left-handed (right-handed) neutrino or that 
the neutrino has four components in a strict sense but it is observed almost as the two- 
component particle”. Now we can prove from (9) that the mass of the neutrino must 
be exactly zero whenever this type of interaction is considered. Let us consider the inter- 


ference term corresponding to the Fierz term which is proportional to 


F, G"S,{ (rp tim.) Oy! (7'p' +im,) BO" B} (19) 
My 
where O,/=74(1+7;) O”=1747; and F,, is a vector made of nucleon wave functions. 
Then it turns out by a trivial calculation that this term is equal to 4iP,F,, m, m,/m, which 
is not invariant under MR(1) for the neutrinos unless m,=0. In the same way we can 
show that m,,=0. It should be pointed out that MR(1) invariance requires that the mass 
of the neuttino vis exactly zero, provided that right-handed and left-handed neutrinos can 
coexist. Such an example was given by Nishijima'” to forbid the unwanted processes in 
the lepton processes. His results are equivalent to taking the 4°, e~, v as the particles 
and assuming the conservation of leptons which was the original idea of Konopinski-Mahmoud. 
The main difficulties of their theory are that the decay modes emitting the / pair or 
electron pair such as Ka7+p°+/', K-»a+e*+e7 cannot be forbidden, and that the 


experiments haye shown that /”, e- and »y are particles. 
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¢4. Selection rule imposed by mass reversal 


In this section we will show that mass reversal invariance predicts why the unwanted 
decay modes are not observed in the lepton processes when muon (u-), electron (e~) 
and neutrinos (y and y’) are taken as particles as proved by experiment. In this paper 
we take as the primary interactions the strong interactions (10), the universal weak in- 
teractions (11) and (5) with V—A couplings and the electromagnetic interactions of 
the form as e ¢), 7, ¢, A, As a direct consequence of this, Pauli type electro-magnetic 
interactions having the form (e/2m,) Ji Fur Yq Fuy only, are allowed from the requirement 
of mass reversal. Thus the magnetic moment of the neutrino should be exactly zero 
because the neutrino has zero mass and the above form of interaction is impossible. The 
consideration of the lowest order diagrams with respect to the weak coupling constant 
g will be enough because #7 is very small. 

First of all we show that the possible decay modes such as K-x7+y° +p, Kor+ 
ette-, K>a+y-+ are forbidden. The decay of the K-particle is understood as a 
result of the virtual process in which the K becomes a baryon loop which decays into 
the muon pair or the electron pair, etc., by the following 


A>N+pt+ypr, A>N+e*+e7 and A>N+ y+ 0. (20) 


But these interactions are not invariant under MR(4) because the ~~, e~ and vy are C 
type fields and A and N are the B and A type fields respectively. For the same reason 
the decay mode K->pv+ 4, (4, is the neutral counter-part of muon) suggested by Marshak 
and Sudarshan” is forbidden. The mode K*—>z~-+-e* does not conserve the leptons and 
therefore is forbidden. The decay mode s—->e+7 can take place only in a diagram of 
the second order in y. Therefore this is hardly observed. The most important problem 
will be to explain the decay ratios o= (7—e +) /(7>+y) <10~° and o,= (te +47) 
/(@>p+y) <10~. This is solved in the same calculation as Huang and Low if we 
adopt the interaction (18) which is a slight modification of the /—A interactions. The 
difference between the interaction (18) and Huang-Low’s interaction is that scalar and 
tensor couplings are not included in the latter. And fortunately the scalar coupling has no 
contribution for those decays and the small amount of tensor coupling does not necessitate 
any change for our purpose as was shown by them. In the same way it is understood, 
provided K-baryon strong interaction have 7°, coupling, why the decay K*—>e*+y is not 
observed while K‘—>2°+e'+y is observed. Thus it seems to be a good assumption 
that the universal Fermi interactions have the form of (18) whenever both electron and 
proton take part in them. This was assumed for the present from the conjecture that the 
special interaction needed for the prediction of 7—>e+» might be closely related to the 
stability of the proton and electron. Then we can forbid the capture mode u- + popte. 
For this case the interaction of the form (18) will work and then the mass of the fis 
required to be zero from the MR(1). Unfortunately there exists no charged particle with 
zero mass. Thus it is impossible. We argue that the decay mode >e-+e* + e> will be 
forbidden for the following reasons: The mass reversal MR(1) for the electron is in this 
case just triple MR(1) which will be defined to be MR(3) for the Fermi interaction from 
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the same reasoning as that MR(2) was the special case of the double MR(1). The other 
MR (3) is known to be applied to the Yukawa type interactions and requires that o,= +1. 
Then it will be natural to take ¢,=-+1 for all MR(3). Due to this assumption the above 
process is forbidden because the e~ is a C-type field, therefore 7, -+i. To check this assump- 
tion, let us apply the same argument to the interaction (A O, N) (N O, N) in Sakata’s 
composite particle model" for baryons and mesons which corresponds to the decay /’—> 
N+7°. The result is that O, should be equal to 7,(1+7;), which means that the 
universal Fermi interaction has /—A coupling, and the helicity of neutrino is —1. These 
are consistent with experiment”, which shows that the definition of MR(3) is correct. 

Finally we will show that the decays K°->y-+-y’ and K->7+ 7-4’ are forbidden from 
the invariance by mass parity conjugation and mass reversal. The mass generacy of neu- 
trions v and »’ will strongly suggest the existence of mass parity conjugation operator C,, 
which is expected to have the same character as that introduced by Lee and Yang." 
Now let us assume the existence of C,, which obeys the conditions 


[C,, M|.=0 (21) 
and 
[Ceding |= 0 (22) 


where H,,,, denotes the interaction Hamiltonians. As is easily known, the condition (22) 
is not satisfied unless we take the suitably chosen interactions as H,,, in (22). There- 
fore we understand the condition (22) to select the interactions H,,,.. Then the follow- 
ing Fermi interactions corresponding to #—->e+y and A>N-+ y+’ only should be taken 


Ky oe Fes 
GB. Ox $n) Gu Ox th) + Gs Ox $y) (Gur Ou $s) 
and 
G(Py Os Pacey) (Pr Os Pr) +G (Pw On Pacer) (Yu On Hv) with O,=7,.(1 £75) 


The latter interactions are not invariant under MR(1) for v’ or » when the leptons are 
C or D type fields under mass reversal, which means that the condition (21) is not 
satisfied. Thus the decays K->(x)+v-+ »’ are impossible because they are supposed to 
proceed through these interactions. 

The other unwanted but unforbidden processes will compete with the observed modes 
and will be shown to be hardly observed. Thus we can say without further discussions 
that Sachs’ assumption that all the lepton processes must involve at least one neutrino 
is essentially proved. 

Though the order of the four fermions in the interactions is fixed so far, we can 
allow it to change and impose the same relationships as before. In this case, more 
stringent restrictions are obtained. The answer is that only V—A couplings”. S+P—T 
couplings and S+P-+ (T/3) couplings are possible when the universal interactions (18) 
are taken into account. We can give its proof for /?-decay assuming the original interac- 
tion to be of type (18), denoted shortly by S/'° (V5 Vy; ¢, ¢»). When we rearrange 
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the order to be 5’, ==S, (VY, gv3 Ge Py), this 5,/© is shown to be related to S 0%) 
as follows : 


ne = > lax} Dy 3 5)” ( a Py > be 4 oD) = te if Px) (92 qd £ is) is 9) (23) 
i 


where 


ra 


eo IH 
Oo NF BY 
CO KY CO BR 


(44) = 


rm bho BY 


— 
bole 


| (24) 
J 


| 


| 
Be RH O NH 
| 


= 
| 


4 


AW 
les 


+ 


If we apply MR(2) to both interactions, it is concluded that the (STP) coupling and 
the (V—A), S+P—T coupling in S,’ go into —(V—A), —(S+P—T) in S, and the 
S1P4+(T/3) couplings goes into S+P+(T/3) in S, in C-number theory. If it is 
required that the interchange mentioned above gives the minus sign to C-number universal 
interactions or matrix element, the decay mode #»>e+e +e is forbidden because this 
has the plus sign by the interchange. This is the other argument to rule out the mode 
fboe+e +e. 


§ 5. Concluding remarks 


The advantage of the theory of mass reversal over the Nishijima theory is that the 
various points are explained in a unified manner in the theory of mass reversal. For 
example, we need not assume the Lee-Yang neutrino to derive Sachs’ assumption that the 
lepton processes must involve at least one neutrino. And we have more freedom to deter- 
mine the coupling forms of Fermi interactions than in Nishijima’s theory. The parity 
nonconservation does not always mean in our theory that the observed neutrino should be 
described by a two component Dirac field because the interactions such as (18) are allowed. 
From the standpoint of MR(1), Fujii and Iwata have examined the polarization problem 
of the /-decay and se decay and they found that there exists a slight difference in the 
case of /7-decay between the two-component neutrino theory and ours. And it would be 
detect this difference when more precise experiments are done, although both theories so 
far explain the experiments well. 

As is easily understood, the special mass reversal (MR(1) can be applied to Sakata’s 
composite model for the baryons and mesons, and we can find that the space parity is 
conserved for the strong interactions, but it is violated for the weak interactions. In other 
words, all the Fermi interactions (including leptons) having only an even number of par- 
ticles with same masses in themselves are strong and the others are weak. One remarkable 
fact is that in the application of MR(1) the Yukawa-type strong interactions (10) and 
the conresponding Sakata theory give different results. This fact shows, therefore, that 
there is a great difference between the Fermi interactions and the Yukawa ones, and that 
the mass reversal MR(1) will be the most fundamental Operator among the various mass 
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teversals as far as the Fermi interactions are concerned, though the appropriate combination 
of the four fermions is required for the strong interactions. But the application of MR(1) 
was limited to the weak interactions in this paper because the charge independence sym- 
metry is working on the strong interactions so effectively that no room for the additional 
introduction of MR(1) is likely to be found. As pointed out by Sakurai, the mass 
teversal MR(1) seems to be possible due to the fact that fermions have two components 
when the strong interactions are switched off. But this paper suggests that such an inter- 
pretation for MR(1) will be true only in a good approximation. The problem remaining 
is to look for the true meaning of MR(1). 

Finally it will be added that the mass parity doublet for the neutrinos may imply 
that there are two kinds of the universal — A interactions with the slightly different 
coupling constants, one of which accompanies the neutrino » and the other accompanies 
the neutrino v’. There is the similar situation in the strong interactions that the K meson 
seems to interact with the baryons a little weaker than the pion does, which may be 
related with the fact that the mass parity of the K meson is opposite to that of the pion. 
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Several attempts have been presented to take account of terms corresponding to large graphs in 
the cluster expansion formulae for the free energy and the radial distribution function, for the purpose 
of applying them to a gas of not small density or to a liquid. In this paper is proposed an approxi- 
mation, which includes the approximations in the past as the first or second approximation, by taking 
account of graphs which can be easily evaluated by means of the Fourier transformation. 

The graphs taken into account in our approximation are all those which can be constructed start- 


ing from a line @—® or a ring cy by a sequence of the processes to replace a constituent line 


e—e by a watermelon SS ; in the course of the replacement, a line in a watermelon is also allowed 
to be replaced by a watermelon. The approximaion will be called the hyper-netted chain approximation. 

The formulae obtained in this approximation for the free energy and the radial distribution 
function are given by eqs. (29’) — (30%) and eq. (35), which demand the solution of a recurrence 
equation containing the Fourier transformations. 

The expansion formulae for the free energy and the radial distribution function by means of the 
“hyper-netted chains” are also presented. They contain the results in the hyper-netted chain approxi- 
mation as their leading terms. 

Another set of the formulae in the hyper-netted chain approximation is given and compared with 
the theories for ionic systems in the past. 

Applications to practical problems will be given in forthcoming papers. 


S$ 1. Introduction 


We consider a uniform fluid of N identical particles in a volume V and temperature 
T’ ; the potential of the system is assumed to be the sum of potential 6(r) between every | 
pair of particles. 

The free energy 4 of the system is expanded in the virial series as” 


A a es 
S ceatle, ) ii. Sei be aioe 
Tir Gea a Dae (1) 


where m is the mass of a particle, 4 Planck’s constant, k Boltzmann constant, p=N/V 
is the density and (7, the irreducible integral given by 


8,= 1 fonder SUITS (2) 


n'iV n+1St> jo 
Sum over all products which are 
more than singly connected. 
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where 

FP =f (ry) =expl—b(r) /AT]—1. (3) 
As Mayer did,” we express a term in the sum of the integrand of (2) by a graph 
consisted of numbered points, 1, 2, +++, n+-1, representing the number of particles, 1.2. 
‘-,n+1, and of lines connecting points i and j tepresenting the factors f,*” in the sum- 
mand. 

To obtain a physically reasonable results by the help of this expansion for the case 
of a fluid of not small density, it will be inevitable to evaluate £,, for large n suitably. 
For this purpose, some attempts to take some suitable terms into account have been 
presented. Montroll and Mayer” presented the ring approximation, in which the graphs 
of the form of a ring are considered besides the graph of a line: the others neglected. 
It is of course suspected whether in this approximation £,, for each n can be evaluated 
correctly even in the qualitative sense. It seems, however, to give correct physical results, 
for this approximation is known to be derived also from another more physically intuitive 
approximation valid also for a fluid of not small density.” When we start from the virial 
expansion formula, in order to improve the approximation we have only to take more 
graphs into account. On the one hand, the netted chain or ring approximation was 
presented,” which takes account of the graphs which are obtained from a ring by te- 
placing some of constituent line ---e—e--- by the: figure .e2s..., in addition. On the 
other hand, Abe” proposed the watermelon approximation, which takes into account all 
the graphs of chains which have two common end points, as shown in Fig. 1d, in addi- 


tion to those taken into account by Montroll and Mayer.” 


te © 


Fig. 1. a: second virial, b: ring, c: netted ring and d: watermelon approximation. 


a Le... 


Fig. 2. Rings of watermelon. 


Fig. 1 shows the typical graphs which have been taken into account in the approxi- 
mations mentioned above. Abe has suggested that the approximation will be improved 
by taking account, in addition, of the graphs of ring of watermelons, as shown in Fig. 2. 
It is to be noted that this approximation will take into account far more graphs than 
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those considered in the netted ring approximation. 

Thus, by generalizing the suggestion of Abe, we have a procedure to improve the 
approximation in a successive manner: We start from a line ¢—e as the zeroth order 
approximation, construct rings of line, construct watermelons by adding to it sequences of 
line e-e-ee between two points of a ring, and then construct rings of watermelon, and 
analogously we have watermelons of watermelon, rings of watermelon of watermelon,...... 
The purpose of the next section of this paper is to show that all these graphs can be 
taken into account for the formula of the free energy: this approximation will be called 
the “hyper-netted chain approximation”. Several simple graphs considered in our ap- 
proximation are’shown in Fig. 3 and those not considered in Fig. 4. 


eee ees 


Fae Pe ee 


Fig. 3. Graphs considered in the hyper-netted chain approximation. 


ho ees 


Fig. 4. Graphs not considered in the hyper-netted chain approximation. 


P the above we have considered the approximation for the case of the free energy 
n § 2, the formula for the free energy in our approximation will be derived, and in § 3 
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the formula for the radial distribution function in the same approximation will be obtained. 
In $5, we will discuss the possible methods to obtain analogous but somewhat different 
formulae for the free energy and the radial distribution function in the same approxima- 
tion and compare with the approximations presented in the past for ionic systems. In 
§ 4, we insert the formulae for the free energy and the radial distribution function, with- 
out approximation, in which the contributions of clusters are summed in the spirit of the 
hyper-netted chain approximation. 


$2. Free energy 


For the purpose of ordering the graphs we introduce the process of “ identification ”’. 
Now, let us call a point a “junction” when three or more lines meet at the point, 
following Abe.” Let us call two lines to be “ identifiable” when they have two common 
end points and both have no junction midway: For instance, in Fig. 5, two lines of a 


and four lines of 6 are identifiable, in c no 
identifiable lines are drawn, and in d upper ~--«2 S-- ee 
two lines only are identifiable. Then, we 


will call the process of replacing identifiable a 


> 


“* identification ”’. 


lines by a line the 


- 
~ 
~ 


c d 


ring © by a sequence of identifications.* The Fig. 5. 


other graphs will be considered in § 4. Here, - - - indicates a line or lines connected 
to another parts of the graph. 


By means of this terminology, the graphs 
considered in our approximation are all those 


which can be reduced to a line e—e or a 


We group the graphs to be considered 
by the times of identifications needed to reduce them to a line or a ring. The times 
of identification are measured by identifying all the identifiable lines in a graph at a 
time ; so that the graphs of C” in Fig. 3 are reduced to a line by one time of identifi- 
cation, those of Ry to a ring by one time of identification and those of D” to a line 
by two times of identification. We calculate the contribution to the free energy 


/ oo i) n ie ea 
— A ey 1 Cos eer ee \fos fared Sa. (4) 
NkT n=1 n+1 n=1 (n+ 1)! V n+1Si> 721 
Sum over all products which are 
more than singly connected. 


for each group separately, and add them up at last. 
We will start with graphs for which the times of identification are small and proceed 


to those with larger and larger times of identification. First, we consider the graph which 
: ee ad f ‘aa 
is a line at the zeroth time of identification. A line ¢—e, representing f,S”, will be 


denoted as B in the graphs in the following. The contribution of B to (4) is 


* As a ring can be reduced further to a line by another identification, these may be said to be all 


gtaphs which can be reduced to a line by a sequence of identifications. 
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= 4) = 0 ((ar,dr,f% (4) =F), (5) 
( recat Rete Nake 
where F (Kk) is the Fourier transform of f(r) : in general, we put 
F™(k)= \ar erg (F) (6) 


for f(r) defined by (3) and f(r) with n=1 to be defined in the following. 


Fig. 6. Ry—rings of B. 


Next, we consider the graphs of ring, the total of which, shown in Fig. 6, will be 
called R,,—rings of B. The contribution of R, to (4) is 


! © ont c r : > ‘0 
(- A ) = Sa cpry \\ fee enh oF (725) -«-*f (T4131) 
Rp =? ! J 


ae AS! 


ar . it 2 p" FO (k) n 
f r=o 


= t hee — oF —oF™ —ie@F 2\. 
Fai 2 | APF (ie) — pF (hk) BF FC} 5 (7) 


as has been evaluated by Montroll and Mayer. 
Next, we consider the graphs which become a line by one time of identification. 
Now let us call each of the sequences of B shown in Fig. 7 By, and define the groups 


Bl OMB guIaipineD, SMABOW, OR 
o—e—e© e—e—e—e ee ee 


Fig. 7. Bs—sequences of B. 


B, 0 Bg 2 (b)a (Ps 


qc 
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Fig. 8. C, C’, C’’-—watermelons of B. 
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of graphs C, C’ and C” as in Fig. 8; B appears besides By because one and only one 
B is allowed to exist between a pair of points. The contribution of C’” to (4) is 


/ 
(-4) -+=>- 0E (ky) | (hy) H® (hy) 
< NkT fo ONG r 1 a 


J 1 (0) (0) ( 
+ eH (ky) H (hy) H (he) + | 


1 ( (0) 
+o] 31V? H ” (k,) H (hee) bie Ghee) 


+p Hh) H (k) H (ey) H (ey + || (8) 
=| ar fof] 2 re) 2 eye | 

+o] 29+ 2 b+ || 
Sas peeke s 


(8’) 
where 
( ) ()? 
H(k) =pF® (k)2-+ 2F® (k)3-4 oe = OE)? 
ge Cart os 1— pF (k) (9) 
and its Fourier transform 
ber => e~t-r FT (fp) (10) 
fe 


represents the total of the propagation lines of B,* ; as has been obtained by Abe. This 
calculation has been performed by considering that : 


: 1) The power of ¢ is equal to the 
number of points in the graph minus one. 2) We may transform the integration by the 


coordinate x of the upper end point of a graph of C” as 


= {dx | dx, | dx, | de. 7 a 9 


ky Io 2 


” 


* By “ propagation line”, we mean a graph which consists of lines, watermelons, «--, with two end 
points and is connected with another parts of the graph by these two end points alone, and by “propagation”, 
we imply the factor by which it contributes to (4), 
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ee ee, thy ay ars, etn f (x +++) fo(%., oo) fi (Eps ++), (11) 
———— eee eé G ; ? 2 
yu 2 2 2 } 1 I : 
uk,=0 

then we can integrate over the coordinates of the midway points by a comes as (8). 
3) Factors 1/2 and 1/2!, 1/3!,---are needed because of the fact that, if all possible 
graphs in Fig. 7 are inserted in B,’s in C’’ and the numbers of particles are pena 
in (n+1)! ways, then 2-2!, 2-3!,---identical graphs appear. The details of this calcu- 
lation are given in Abe’s paper. . 

Next, we consider the graphs which reduce to a ring by one time of a 
These graphs, to be referred to as K,~—tings of C, are the total of those graphs in Fig. 9 


es: 
J [| 
¢ GC 


Fig. 9. Re*—rings of C. 


for which at least two edges are occupied by a propagation of C’. That is, Re is R-*, 
the total of Fig. 9, minus the graphs for which all C’s are equal to B and the graphs 
for which one of C is C’ and all others are B. At first we calculate the contribution 


to (4) of Ran 


A’ \ 1 oo 1 ; 
= — Pe oe 1 (k)" (12) 
( NkT he 2eV > n=3 Nn 


=] - In pF?) pF) 2 FP (R)| a2) 
2eV k } 


where f(r) and. its Fourier transform F\')(k) represent the propagation by the whole 
graphs of C in Fig. 8: 


= at 0) n — 1 (0) mn 
fOr) = £07) a ap ‘(r) rh Qasr he (r) (13) 
=f (7) MOO 4 eal) _ 1 _ f (p) , (13’) 


To obtain eq. (12) with (13), we have taken the following procedure. First, we consider 
the contribution of a graph of the type just considering. We integrate first over all the 
coordinates except two end points for every watermelon constituting the ring. Then we 
have a term in the summand of (13) for each edge or watermelon of the ring, having 
the distance between the two end points of it as r. Then, we can easily integrate over 
the coordinates of vertices of the ring by an aid of the convolution. We have then only 
to sum the contributions of all possible graphs just in consideration, where cares must be 
taken not to add the contribution of a graph many times. The coefficient can be evaluated 


a= 
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as follows. First, we construct all possible graphs by inserting one of C’s in each edge 
of the ring of C in Fig. 9 independently of the occupation of the other edges and, in 
turn, one of B, in each line of C independently of the occupation of the other lines ; 
and next we take all possible permutation of the numeration of particles in the graphs, 
then we have (number of particles) ! terms from the graph above constructed. There 
will appear many graphs indistinguishable with each other not only in the form but also 
in the numbers of molecules.. To obtain the contribution to (4), we take this into 
account by introducing the factor 1/2n in (12) for the rotational and reflective symmetries 
of the ring and the factors 1/2!, 1/3!,---in (13) for the invariance by the interchange 
of 2, 3,---lines of B, in the watermelon C of the edges. 
In eq. (12) we divide F\(k) into two parts due to B and C’ as 


PO (kh) SPO (k)-+ an (ky, (14) 


then the term of the zeroth order in JF“ (k) represents the contribution to (— A’ /N&T) pox 
from the graphs for which C’s are all equal to B and the term of the first order in 
4F(k) from the graphs for which one C is C’ and all others are B. The former is 
equal to (— A’/NkT),,, (cf. eq. (7)) and the latter is obtained as 


= Fk)" AF (k) = © S11 (k) dF (k) 
Perro) (k) = SH 


as, H® (k)[F® (k) —F® (ky|= © \ dr rp (rf rs + G3) 
2V Uk 2 
where the relation 4 (r) =h\?(—r) has been used. As the result we have 
eG ae 
WEB Ay piclvolaNil 1a,* NET /n, 


—£ [arb (rf) -F@)]. a 


With the aid of eq. (13), (8’) can be rewritten as 


A! =a p? (1) (0) (0) (0) r 
— 4) =£ [Fey Foy} 2 | def) 4) 


[deo (ry’ (8) 
4 


Next, we proceed to the graphs which become a line by two times of identification. 
Now we construct the sequence C,* of C as in Fig. 10. Then we define Cs by Cs= 
C.*—B,. Then we construct the watermelons of C and Cy as in Fig: 11. The graphs 
of D” are the graphs to be considered here. The contribution to (4) is obtained as 


Cc G Cc c Cc Cc Cc G ¢ 


Fig. 10. Cs*—sequences of C. Cs=Cs*—Bs, 
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Py 
ar ee D 
Fig. 11. D, D/, D//—watermelons of C. 
‘= Zo ) anak | dr f(r) 1 — dh (r) | 
w UNAT 7 ag esa 


+ 4%) _ 1 — dh (r) —5; [4h r(r)Fl, (17) 


just like when the contribution of C” was obtained, where Jh"(r) and its Fourier 
transform of JH (k), defined by 


4H (k) =H (k) —H (k) 


(18) 
are the propagation by C,, H (k) being the propagation by C.*: 
sel (3 
H®(k) =" _A*) 19 
(a (19) 


Next, we calculate the contributions of R», the rings of D with at least two D’s 
equal to D’. Just like for the case of Ro, we calculate it by subtracting from 


D 
Par yorbcll i 
D D 


Fig. 12. Rp*—rings of D. 
(— A’ ) = lyss u o” F° (k)” 
NkT Rp* 2eV FS ant : 


n 


pani Jows 


| 1—oF® OZ (2) eee ae 2 
ne] 3) —In pF (k)) pF (R)—3 er FOCK)}, (20) 
where F® (k) and f(r) represent the propagation by D: 

fOr) =FO (HY) eA 4 CMM) _ 1 — AH (r), 


(21) 
the terms of the zeroth order of F (k) —F\(k), equal to (—A’/NkT), , and the 
term of the first order, equal to ’ 
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re HO (k) [FO (k) — ONE Pa jars my) (Rear) =O Cr) | (22) 
As the result, we have 
Ad =A ee Oe ee ie 
Gs NkT We NET /2p* ( NkT i 
(23) 


a \ar°@)[F@) —f(r)]. 


The contribution of D” to (4) can be rewritten as, by substituting (21) in (17), 


(—4_-) =-£ [F®@) —F®(0)] 


NkT 2 
—L\ arg (PyAbe(r) = £\ dr[ an (r) (17’) 
2 
and so forth. 
In this way, the contribution of the graphs B’”——B”=B, BY =C”, B® =D"...- 
——which are reduced to a line by n times of identification is obtained as 
= ERAS Foe 0 Sy By) 0 
( 7 [F(0) (0) 


£{ dr [Ab (r) (24) 


: =-2 | ange (r) Abe? (r) = 


where F‘-” (k) =0, JA‘? (r) =h—” (r) The contribution of the graph R°’——R” 
=R,, R”°=Re, R®=R,,---——which reduce to a ring by n times of identification, is 


ial = Bie oe 
( ) i Gal NkT Rin-1)* 


\ NkT Rin) \ Rom* 
—-£ | drier) [fF (r) pr) (25) 
Z 
with 
ey 13} {—In(1—pF (k)) — pF (k) — 3A FO (B)*} (26) 
ee ED Ro * i oF ke 
Summing these up to B“ and R“”, we have 
= A! ) =! FQ) 
NET /3_(BOFR) 2 
+2 S3{=In(—pF  ()) pF () —4 7 FO ()*| 
20V 
(27) 


1) n—1 
= 2 fdr f(r) fo 7) —E I de [dh YP 
g 4 s=0 
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a n+1) 
Summing these up to R” and B“"*?, we have 


pene. He ae 2 F“*(Q) 
\ NRT 73) (BO) +R) + BOD 2 


: 5} {In (1 — pF (k)) —pF (k) -3 7 F (k)*t 
20V k& \ 


p 


; | dr S2{ th (r) F. (28) 


= ol dng (r) b (r) — 


In the limit of n—>co, these both tend to 


f ) 


F (0) +S} —In (1 — pF (k)) —pF (k) —4, °F (k)*t 
2 20V ke 


\ a HNO 


with 


4 (r) =limf™(r), 4 (r) =limh™ (r), F (k) =limF™ (k) 


n> co mo n>c 


fee (r) =f (r) PV AGIECD) en a =e dh (r) | 
Ab (r) =h (r) — bY (r) 


Am (r) =) sect HO (k) 
ae (30) 

e (n) 
H (k) =—CE™ BY" 
— pF (k) 


Fo (k) =5 7 ether £m) (r) 


f® (r) = eR )IAT __ y, Ad (r) =, 


When the potential d(r) is a function of the distance r=|r| alone, then so are f°?) 
and A" (r), and F™(k) and H™(k) are functions of k=|k| alone. Then the above 


set of formulae can be written as 


47 0 


Paar a FOO) + gh [Pet —ln.—0¥ 0 F Yo oF (b)'| 


oO 


—27p | rdr/(r) 4(r) —z0 - ‘dr iL 4b (r) (29’) 


with 
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4 (ry) =limf™ (7), 4 (7) =limh™ (7), F (k) = lim F™ (fh) 


n> oo n-> co 


if ash (r) =a (r) 04h) (7) he eAnin) G2 i= yA) (r) | 
dh™ (r) —fh™ (r) _ foe) (@) | 


1 
AD) =e | 
(7) Prosi 
a, ) (30) 
H®” (k) = Lope pads | 
— pF (k) | 
F (k) =2— eat rdr sin kr f™ (r) 


f(D) =f (—1) =e FO 1, HPO =O. , 


Thus to get the numerical results in our approximation we have only to repeat the 
Fourier transformations starting from a given f\ (r) =exp(—d(r) /kT) —1. 


§ 3. Radial distribution function 


In the previous section, we have developed the hyper-netted chain approximation for 
the case of the free energy, from which all thermodynamical quantities can be derived. 
While, when we know the radial distribution function g(r) we can calculate the pressure 
p and the internal enery E directly by 


p 4O 

pk 7 ne iv (7) pus a 
E i 

per 1+ er | 160 Soyanrtar, oe. 


The virial expansion formula for the radial distribution function g(r) is given by?” 


I, >) epee setaee +t +S) Ha. (r, ; ry) pv ga (33) 
et, 
Ham (rs, #2) =(1/m!) Vo (drys ms ST ff G4) 
“ ome 
2SeneS1 
Sum over all connected products for which each 
molecule of the set {r3, °°, Tm42} is connected 


to r, and ry by an independent path. 


In this case also, we can group the graphs by the times of identification needed to 
reduce them to a line 2_2. In Fig. 13, C* contains the graphs of one time and two 
times, D* those up to three times, and so on. The graphs which cannot be reduced to 
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° 
} > De. 
° 
Ct 
9 fe 
Cx c* e ic ¢ Cs ad >  & 
O 0 
D* 
fe. 
D* D* Dy D* () Ds ZOD D. 
O 
E* 


Fig. 13. C*, D*, E*,--- 


a line by identifications will be considered in § 4. 
Taking the graphs C* only into account in (34), we have* 


ly (7) ] c*k= e &&) [eT +h (x) : 
Taking the graphs D* into account, we have 


[9(r)] . — eB (r) /ATHAO (7) +R Yi —¢ SY. ETHAD (FH) 
‘ Yi ae = 


In the approximation of E*, we have 


2) 


[9 (r) pee POTD) 


> 


and so forth. 


As the result, we have at last 


LY (7) huxc=exp| —6(r) /AT +4 (7) |]; (35) 
4 (r) is the solution of (30’). 
That this formula for g(r) is in the same approximation as the formula for the 
free energy in the previous section can be confirmed by taking in (2) only the graphs 


considered in the previous section when we get (34) from 4 of (2) or (4) by the 
formula due to Bogolyubov” and Hiroike® : 


2V OA 


(OS NE BO 


*“ After the completion of the manuscript of this paper, the author was noticed that Meeron!)> 14) had 
presented some papers which are related to the summation of the cluster integrals for the distribution functions 
and potentials of average force. His result in ref. 14) is identical to ours in this approximation C*, with 


corrections in terms of 4, fi, and f™ in our notation. His result for g(r) in ref. 13) for ionic solutions 
is also in an analogous approximation, 
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If we follow the line of Meeron™ and are to calculate the potential of average force 
by means of the formula 


Ings). = Sa auLE at See r.) (3.35) 
Ket T3)= (1/m!) \\ a | drys mse TT ff, (34) 
2>x>1 


Sum over all products for which each molecule 
of the set {rs, +++, 7,49} is connected to r, and 
ry by an independent path and also the molecules 
{rs °-*, Tm4o} are connected to each other even 
when we erase the lines ending at r; and ro. 


the graphs to be considered to derive eq. (35) are obviously the total of 


‘B, Nice )p, EA ; 


} (a 


$4. Expansion formulae for the free energy and the radial distribution 
function by means of the hyper-netted chains 


In the preceding sections, we have considered the graphs which are reduced to a 
line or a ring by a sequence of identifications. In this section, we consider the remain- 
ing graphs. We group the latters by the graphs of junctions which are obtained from 
them after a sequence of identifications is applied until they come to have no identifiable 
lines and then the points other than junctions are neglected. Then we calculate the 
contributions of these groups to (4) and (33) with (34) along the line used in the 
previous sections and obtain for the formulae for the free energy and the radial distribu- 


tion function, without approximation, as follows : 


sent! pees) 
NkT \ NkT /HNC, given by eq. (297) 
co n—1 
tne \\- dr arg SB ID Chak) (36) 
n=i n!V n>t> joi 


Sum over all products which consists of junctions 
alone and are more than singly connected. 


Iz) =(9 (12) |HNC, given by eq. (35) 


1 33 [ff dry drs SM tes) fatto}, — G7) 


m=2 m! . m+2>%t>j>s 
m+2Sho3 
2>n>1 


Sum over all products for which each molecule 
of the set {1r3, -**, 49} is a junction and is 
connected to 7; and ry by an independent path. 
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where 
pstteg=L (%) +4 (1,3) > 


/ (r) and 4 (r) are the solutions of eqs. (30’) and represent the ee BD the 
graphs, Z and Zs, which are the limits of the series B, C, D, -- and Bs, Po # mety 
respectively. The graphs corresponding to the products of the agg terms of eq. 
(36) and of eq. (37) are shown in Fig. 14 and in Fig. 15, respectively, for small n. 


Fig. 14. A line e—< represents the hyper-netted chain, Z+Zs: the limit of the series, 
B+Bs, C+Csz, sae 


ee gee ee 


Fig. 15. A line e—e represents the hyper-netted chain, Z+Zs. 


It is to be noted that eq. (37) can be written also as: (cf. eq. (33’) with (34’)) 


Ing (ry) =— fe) Ere a 
‘i S) as 
m=2 m! 


e : 
’ : , : 
\fos-|drgss-drmee SM Aste) An bhen) 
vd os m+2Di> jos 
m+2oko5s 
2>x>1 
Sum over all products for which each molecule of the 
set {rs, °°, Faso} is a junction and is connected to r; 
and ry by an independent path and also the molecules 
{hs, °*'y Tymaoh are connected with each other even when 
we erase the lines ending at r,; and ry. 


$5. Another possible expansion schemes 


We have, in the preceding sections, obtained the formulae for the free energy and 
the radial distribution function in the hyper-netted chain approximation by grouping the 
graphs consisting of lines corresponding to f;\”, according to the times of identification 
néeded to reduce them to a line or a ring. When @(r) has the Fourier transform, we 
can do the same thing by first, expanding f\(r) in (4) in powers of —d(r) /kT, then 
corresponding a line to a factor —d(r)/kT and again grouping the graphs by the times 
of identification needed to reduce them to a line or a ring; which leads to another 
recurrence equations in the same approximation. 


Moreover, we can do the same thing by 
dividing the potential into two parts as 
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(7) =0,(7) +4(7), 


935: 


(38) 


@.{r) being assumed to have the Fourier transform, then introducing f,(r) and j(r) by 


eT PLA __ il = (r) , — 2200). 3 i(r) 


and expanding as 


—s(r i = Le nm = ys n 
y Os be 8 paar A) PRS, oi d 


(39) 


(40) 


then corresponding a line to f,(r) and j(r) and again grouping the graphs by the times 
of identification needed to reduce them to a line or a ring. With this expansion, what 
we are to evaluate are (1) with (2) or (4) and (33) with (34), with f(r) replaced 
by the right-hand side of (40). The merit of this method is in that the first Fourier 


transformation may be done analytically ; while the second, third, --- 


will have to be performed in the numerical way. 


Fourier transformations 


The contributions to the free energy are calculated seperately for each group of the 


graphs grouped as in Fig. 16, as follows: 


O 


(- = Leo 


with 


fOM=AO4+j/@.- 

1 
( cir ls 20V 
A’ ; ; ha take r)2 
re el aio 


oP yer 200) |= Ol dr fi [4) +) 


3) |= In (1 pF ()) pF) 3 6 


— $f dr[o°) HOF 


=£ [F(0) —F (0) ]—“_| def) (0) — 
Z 
with 
pF (k)? 
H™(k eS 
( )= — pF (k) 
and 


ree =F (ir) OG) +9) 4 AO 450) _ 1 — fF (7) , 


Ce 


jars (r)2 


(41) 


(42) 


(43) 


(44) 


(45) 


(46) 
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" ; Ae 
G ae eae - pale? | deh MEF? f°], (47) 


and so forth. As the result, if we make the correspondence B'”’ =6, BY =b/ +c", B” 
=" ---and RO=R,, RV=R,, RY = Ka, >, we have eqs. (24)—(28) with the recur- 


rence equations : 


6 
Oy Gente eee Aan 8 
b 
v_--—- ————qX« ,—-—“- ————— 
6 b’ R 
b 5 6 b b b b b 6 


iS Cc < c x Ps 
¢ < Gee ue R is rings of ¢ with 


at least two c’s equal to ¢, 


(s=cF—b,. 


i : b/: . + , 
Fig. 16. b; b > Ry bs, cf? Cc e, Ro. Kos os, CS» d’?s d, d’,--- 
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) (n) 2 i\ 
H®™ (k) = pF (k) | 
1— pF (k) | 

| ee (r) af (r) edn) ) a8 eAnlr) OL ee Ap (r) Jn = 1; | (48) 


jay (r) =f, (r) eh) t9) 4 er IC) KO) (7) | 


FROBEG) ai Oss 
F™ (k) and H”(k) being the Fourier transforms of f(r) and 4” (r), respectively. 
As before, we put 


f(r) =lim f (7), 4 (r) =limh™ (7), FA(E =lim F™ (8). (48’) 


Then, the free energy in the hyper-netted chain approximation is given by eq. (29) 
again. 


We get for the radial distribution function 
g(r) =exp| —d(r) /kT + 4 (r) | (49) 


in the analogous way as in the previous section.* 

To compare with the results obtained by various authors in the past for fluids of 
particles interacting by the Coulomb potential with and without hard core in a compensat- 
ing field, we write the results in the ring and the watermelon approximation. In the 


ring approximation, 


ea i ons F0) +" | drfin jo + | dr jor) 
\ NkT btb/+Ry eZ ae Aye 
+1 _s3{—In(1—pF (&) —pF °® —3 °F}. (50) 
20V k. 


In the watermelon approximation*** 


\ 


A’ ( A’ 
ae TU We NkT Ma 


“8 ENS AG |Episkt= Shon Coker N 


4 gn ©) +5) __ 1 — fo (r) —j(r) =e 4 Ca? (ay) +j(r) +t (51) 


=f (0) + _33{—In(1—pF(®) pF @) — BF} 
D 20V hk \ 


c 0 


Repro! 


dr h (r)2, (51’) 


? 
2 Aa 


* The graphs which contribute to g(r) are represented by Fig. 13 if we replace the large letters B, 
C, D, ---, by the small letters 6, c, d, --- 
ia ee that, when f;(r) =0, the third term of the right-hand side cancels with the sum of the last 


ahigehs ae that, when f,(r) =0, the 5 (r) +j(7) is the cut-off Coulombic potential if j(r) is Coulombic. 
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The ring approximation without hard core, eq. (50) with f,(r) =0, is equal to Debye- 
Hiickel’s result as was shown by Mayer,” which was also obtained by Zubarev” by the 
use of collective variables. Mayer” took account of the terms up to the first power of 
fi(r) in eq. (51), and Haga” considered some terms in addition. Yukhnovsky’” has 
given the formulae in the watermelon approximation up to the first order of f,(r) with 
some additional terms representing a triangle of watermelons, by the use of the collective 
variables. It is to be noted that our formulae have been obtained for one-component 


system, while the theories compared’*” are valid also for multi-component systems. 


§ 6. Conclusion 


We have thus found the possibility to compute the free energy and the radial 
distribution function of fluids, in the hyper-netted chain approximation, with account of 
far more graphs than those considered in the approximations in the past. 

‘The numerical computations for fluids of particles with Lennard-Jones potential and 
those of hard spheres and the extension of our method of approximation to the mullti- 
component systems will be given in the following papers. 
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The meson production by mesons in the low-energy limit has been re-examined in the frame of 
static model of the extended source theory. Using Feynman’s technique the calculation has been 
performed up to fifth order in the coupling constant. The renormalization procedure is the same as 
developed and applied by Chew in mesor-nucleon scattering processes. To reduce the Feynman 
diagram an approximation similar to that of the Tamm-Dancoff type has been used. The total cross 
section for charged and neutral pions from the pion-nucleon collision in T=3/2, J=3/2 state are 
obtained for incident meson energy 260 to 570 Mev. The theoretical results have been compared 
with those of Franklin, Barshay and others. Franklin’s result that production of charged pions is 
higher than that of neutral pions in the cut-off theory is confirmed. The present calculation predicts 
the charged pion production of about 7 mb at 500 Mev against the available experimental cross 


section of a maximum of 10 mb at the same energy. 


$1. Introduction 


In view of the recent experimental evidences!) of the production of one additional 
pion in pion-nucleon scattering at low energy, the problem of double scattering of 7-mesons 
by nucleons has of late been approached by Barshay” and Franklin®. They have made use 
of the extended fixed source meson theory of Chew” et al, and based their calculations on 
a new method for meson-nucleon scattering developed by Wick” which involves the study 
of the eigenstates of the meson-nucleon Hamiltonian. Fran'clin has calculated the transition 
matrix from one meson state to two meson state using two meson eigenstate of the total 
meson-nucleon Hamiltonian analogous to ordinary meson-nucleon scattering, which needs the 
one meson eigenstate to describe the transition matrix of the scattering process. Barshay 
also, using a similar procedure (as studied by Low”), has obtained results which are quite 
different from those obtained by Franklin. The discrepancy between the results of Franklin 
and Barshay can be seen by comparing Franklin’s equation (27) with Barshay’s equation 
(20) for the production T-matrix. It is noted that apart from a factor 1/, ‘2 (due to 
different procedure of normalization) Barshay has obtained only twice the first term of 
(27), the second term of (27), however, vanishes for the 3/2 isotopic spin state and 
the rest of the discrepancy as has been pointed out by Franklin is due to the fact that 
Barshay has used different approximation to identify the certain parts of the production 


T-matrix with the elastic scattering T-matrix. 
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In this work we have adopted the Born-approximation method and accordingly started 
from the lowest order Feynman’s diagram, for the production of mesons in a 
interaction. To obtain higher order corrections to the lowest order diagrams, Dyson's 
procedure of modification of nucleon lines and vertex operators are used. However, one 
must be a little careful in estimating the order of magnitudes of the successive terms of 
the Born-approximation. The validity of this perturbation expansion was checked by 
Friedman” and Kundu” in a different problem, namely the anomalous magnetic moment 
of the nucleon. Again in the present problem it is found that the 5th order correction to 
the production cross section is small compared with the 3rd order one and probably the 
higher order corrections are quite negligible. It must be mentioned that though the 
unitarity of the S-matrix is lost in such an expansion but within the limit of the static- 
source approximation the theory does not predict the cross-section which is beyond the 
geometrical one. As is apparent that the number of Feynman diagrams will increase 
enormously while calculating the 5th order correction to the lowest order Feynman diagrams 
of the production process, we simplify our calculations by restricting ourselves to a Tamm- 
Dancoff type of approximation”. If, in any diagram, a line drawn perpendicular to the 
trajectory of the nucleon cuts more than three mesons (real and virtual) at an instant 
of time, we neglect the diagram. 

In the present calculation we have determined the production of an additional pion 
(z* and z°) in the pion-nucleon scattering, at the energy range 260 Mev to 570 Mev. 
The theoretically calculated values of the pion production cross section in milli-barns at 
different energies is given in tabular form in the last section, and the same was compared 
over with those of Franklin and Barshay. Rodberg’” and Kazes'', using a ‘ one-meson 
approximation ’, have obtained an integral equation of Low.** Some discrepancies in the 
results of Barshay and Franklin have been cleared. Their results are in conformity with 
our calculations based on pure perturbation analysis. 

It is worth-noting’, in this connection, that this problem of double scattering was 
also treated in the relativistic formalism with the use of both the covariant way of Feyn- 
man as well as in the conventional perturbation theory. But this calculations were restrict- 
ed in the ps-ps theory only; and, as is well known, the production cross-section was 


much too small to predict the experimental results, even with a high value of the coupl- 
ing constant. 


§ 2. Matrix element for the production process 


Following Chew, we write the lowest order matrix element M for the meson produc- 
tion corresponding to Fig. I, as follows : 


M(p*, q*, k) =M,(p*, q*, k) +M,(p*, gk) +M.(p*, 9%, &) 


+ similar terms interchanging p and g.--------- (1) 


** The author is grateful to Professor G. F. Chew for informing him the results of these important 
papers and also for valuable suggestions which led to the material development of this paper. 
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Fig. I. Feynman diagrams in the first Born-approximation: Other three digrams are obtained by 
interchanging p and g. (All T should be replaced by r.) 


with 
M,=V,* S(E—o,)V,* S(E—o,—0,) Vi 
M,=V * S(E—a,)V; S(E— 0, + 0) V3* 
and M,.=V, S(E+a,)V,* S(E+a,—,) V,*. 


In writing down the above matrix elements we have used the interaction Hamiltonian 

H,=>)(a, V,.t+a,* V,*), where a,’s are the usual creation and annihilation operators 
k 

V,=if (47)'" pe" (2m,) '? (o-k) Ty (k), and S(E) denotes as usual the nucleon propagator 

in the sense of Chew. Here-in-after the argument in the parenthesis will be dropped for 

brevity. 

To obtain the fifth order correction term we apply the usual renormalization procedure 
of Dyson as formulated by Chew for the static-source theory. We denote this contribu- 
tion by M, etc., where 

MO =V,O4M,0+4M 
+terms interchanging p and q.:-:---+:: (ae) 
The M,{’s are given by 
MOSS. Baa )V i et SLE ors 0g) V 


where 
: fi if 1 Ls 
V,*=V,"| 1+1/9 aaa oe ——-— | 
4 fi f gr ie ( 2 W7 Ew; Tae 
1 1 Ea 
ee S3\(E) =S(E) VV +=+—) 
Z Ew, (Oy oO; 


are renormalized vertex and nucleon propagators with w, being the virtual meson energy 
and E,, E, are the nucleon energies before and after the vertex. 
The net contribution up to the fifth order in the coupling constant from above is 
given by 
M+M 


where the prime in the second term denotes that only the 5th order contributions from 


M® has been taken. 
In addition, we are left with irreducible diagrams, a sample diagram of which has shown 
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Fig. Il. M,° 
in Fig. II. 
M®=M°+4+M,©+M, +terms interchanging p and g., ---+0r0e00t00 (3) 


where M,®’s are the corresponding contributions from the 5th order irreducible diagrams 
obtained from the diagram of the type ‘a’. It should be pointed out that the part M© does 
not contribute to M” because of our assumption that diagrams in which there are more 
than three mesons present at an instant of time are to be neglected. We have for MS 


MOS SCV G Vag Vala Visti le Cota ten te el ate 
7 
+similar terms interchanging p and gq. 


and M,” is obtained by using Goldberger’s crossing symmetry property. f, f,, etc., are 
functions of w arising from the energy denominator of the nucleon propagator. 

Thus the total 5th order contribution to the matrix element T(p*, g*, &) for the 
production of two mesons of momenta p and q from an incident meson of momentum k 
is thus given by 


T (p*, q*, k) =M4+M'O4M®, 


Collecting all the contributions from (1), (2) and (3), we can write T(p*, g*,k) in 
the following simple form 


3 apm N*o* N*).* Nok : ae 
T (p*, q*, k) = ae (47) 92 ue A (p) Nt (q) Nv(®) [> PF 2y pag (@) Fama a | (4) 
“e V 20, V2@,. V 2m, ~ 


where >°" denotes the summation of all terms obtained from the all possible permutations 
of the indices of ¢ and 0; T,y, stands for 7, 7, 7, and o,,, for (o-p) (o-k) (o-q). 
Again Fyuy.px,() are functions of the energy variables w,, @,, w, and also of the meson 
mass /4. Of the total number of terms in the expression (4) half of the terms are 
obtained by interchanging p and qg. The detailed expressions for Fyy, p(w) will be given 
later for the particular case of production of two mesons in isotopic spin-state T=2 and 
T=1 from the initial meson-nucleon T=3/2 and J=3/2. 

It may be pointed out that due to our method of approximation we have vanishing 
contribution from the energy co-efficients Fuav,rop(@), Furviorp(), Fuvazoo(w) and 
PVA phy) appearing in (4) together with another four terms obtained respectively by 
the interchange of indices 4 of the first two and jy of the last two co-efficients. 

In the following we shall be interested only in the specific determination of the <* 
and =" production from the meson-nucleon interaction in the isotopic spin-state, T,,. We 
note here that due to the conservation of isotopic spin the matrix element M should 


——o— 


~ 


a 


Oe 
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‘satisfy 


Gist. 


M Gls T3) ee Te) =M CLs fey) On Ongret 


where the initial and final states are characterised by the quantum numbers T, T, and 


Tt 


First, we calculate the additional production of 7*-meson in the reaction P+7*—> 
N+7a*+27* and we abbreviate the corresponding matrix element by [ 


(Tye RG ret Re k) |T 33) (5) 


where TJ denotes the total meson-nucleon iso-spin states and I denotes the total iso-spin 


-states of the two outgoing mesons. T, and J, are the third component of T and I 


respectively. Similarly the matrix element 


CL ald CB qe, ee, ; (6) 


‘denotes the production of 7° in P+7* scattering. 


For the process given in (5) we have from (4) the following expressions : 
Sy Sorelle). (7) 


-and for the process given in (6) 


le) (8) 
where 
Pook Le (Fyyv,p2 +P vr, var) i DA Enupatiel We pase aie!) 
| Rees fine (Fius,ste) ye Ceres = een ae re) 
dB | Be sit CE oad ip) V2 (FP ivy ane ce Cop oke) 
Pas ) a sae ay é | CPx aeceey) J 2 Oye eed Sh its rig a) 2 
i sree | ane | (Paice) V2 (= Fyiyjtopt Pvaukew) 
\ Het ee) (Eo donee Fuge cen) V2 CEy cana E targoreiey iu, nk) 


‘The values of the matrix element appearing in the right-hand side of equation (9) are 
given in the Appendix. We note here that in the production of 27* and a‘x° from 
Px* system the former is symmetric with respect to the interchange of the isotopic varia- 
bles of the two outgoing mesons while the latter one is antisymmetric. 

In deducing (7) and (8) we have replaced the z-matrices by their eigenvalues for 
the respective transitions from the given initial isotopic spin states to the final isotopic 
spin states as denoted in the expressions (5) and (6). This replacement of the t-matrices 
by their proper-value can be effected by noting that one can use 7,, ¢_ and 7, instead 
of 7, T t3(zp, etc). This requires only a slight modification that since T,, T_ are not 


“129 


hermitian one should be careful about the ‘ star > in the emission. One should remember 


also that a 
oP DN, tN 2 ee cat a, 


¢,0P SP N= SN; 
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where N and P denote the neutron and proton states of the nucleon respectively. 
Following this, one can at case construct a table of the eigenvalues of different com- 
binations of the three c-matrices (ie. of T,, Ty, Ty, etc.) from different transitions such 
as P+at—y»N+7++7"*, etc., just similarly as obtained by Wick (cf. Table 1, page 348 
ref. no. 5) for the combination of two 7-matrices in transitions P+z*—P47"-, etc. 
We now expand the scattering matrix T(p*, q*, &) in terms of the projection 
operators for the various states of the total angular momentum. We keep here only the 
angular momentum states characterised by J=3/2. Using the well-known projection 
operators P, for the state J=3/2, we can write the T-matrix for the production of z~ 


from equation (6) in the following way, 
Sie: lnalt (Ps Gr, eee ea 
= (1/3)[ (op) (pu. 2 (kg) +io-(kXq) ]} + Wegpl 2 (bg) tio: (kX) (o-P)] 
+ terms interchanging p and q. (10) 


The result for the square of the matrix element averaged and summed over initial and 
final spin states is given by 


if )p'g?k’ (a, +6, cos?) = (11) 


1 1( 4z 


IT. (p%, 9%, BP=| Laz) 
L 


8M, Wg Wx 
where T+ (p*, q*, k) stands for the T-matrix element appearing in the left-hand of the 
above equation (10). 

In a similar way the square of the matrix element, T..(p*, g*, &) for the produc- 
tion of 7° is given by 


|Tao(p*, g*, 2) = ee : = i = \p'gk (ap +by cos *#) (12) 
where 
4; {2 (1 pg +L egy) + Donp— Crepe) $2 + 4 ¢ A A) 
+ Lip Lop) P+ LD ae +2 apg) — Pore +P vgn) }? 
and by == 3 {2 (Pap + Pop) + Tore Page) }?= { Popp + Pape) — Pee + , aed 


+21 {2 Doig + Page) + (Coro Papp) } {2 Doan + Dine) + Leg — Cree) } (13) 
+12 (P yap +L apy) + Lng — Crap) } { Prego — Core) + Pie — Prony) } 
+1 ep —L pq) + Pape — Pore) } {2D ptg + Pregw) + Corp — Pepe) }) - 
a, and 6, are obtained from the same expressions for 4, and 6, respectively given above 
where now all the /’’s should be replaced by their primes. 
§ 3. Cross section 


| The differential cross section for the production of an additional pion in the scatter- 
ing of a pion of momentum k into two types of pions of momenta p and q is given by 
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do =2T/v Ds] Pr\t (p*, q*, k) 


where v=relative velocity between the incident meson and nucleon and in the static limit 


2 0(E,—E,) (14) 


v=k/a, and py is the number of states per unit energy level. Remembering the conserva- 
tion of energy, we get in our approximation w,=w,+w,. In (14) we will use the 
value of T+(p*, q*, k) and Tyo(p*, g*, k) for T(p*, q*, k) for the production cross 
section of 7* and 7° respectively. Using the values of (11) and (12) together with 
(13), we get from (14) that the total cross section o for the production of 7* and 7° 
in P+a*—>N+7*+427* and P+7*->P+7*+7°, respectively, is given by 


ae 


1 e 9 9 2 { 9 9\ 3/2 9 9793/2 
pe eed — (we —/£) =| dQ | (erg? — 2)? (wp— Wp)? — 12? (a, +8; cos? 9) deny 
Tomy fi u Ci5)) 


where i=1 denotes the production of 7° and i=2 for x’ production. 


§ 4. Results and discussions 


We now give the numerical results for the total cross section o, determined from 
the formula (15). The results are tabulated for incident meson energies (laboratory 
system) from 260-570 Mev and compared with those of Franklin and Barshay. In 


obtaining the results two adjustable parameters, the coupling constant f° and the cut-off 


Table 1. 


Cross section in mb. 


= 
| Incident meson | 
Processes | 


gaeteynin Dev. | Present Calculation | Results of Franklin | Results of Barshay 
260 0.067 0.040 | ~0.071 
P+ntoN+att+at 360 0.889 | 0.970 | ~0.860 
570 | 15.640 | 9.250 
260 | 0.037 0.037 ~0.440 
Ptnt—>Pt+nt+r 360 0.484 1.040 | ~0.550 
570 8.017 6.540 


energy «, are chosen to be .093 and 6/: respectively. It appears from the table below 
that the charged pion production is larger than the neutral pion production, contrary to 


what has been obtained by Barshay. The only experimental data of Blau and Caulton" 


on the charged pion production cross section by free nucleon at incident 500 Mev meson 


energy give a rough estimate of 3.5 to 10 mb. 
using the above values of the coupling constent and the cut-off energy the production 
cross section of 7 mb at 500 Mev meson energy. This is fairly in good agreement with 
their estimate. Calculations at still higher energy such as 570 Mev predicts a value of 
15 mb, of the production cross section indicating a complete breakdown of the cut-off 


theory and in fact this result exceeds the geometrical cross section (74°). However, it 


In the present calculation we find on 
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must be pointed out that the energy for which the cross section goes beyond the geo- 
metrical one is too high to expect any fruitful result with the static-source theory. 
Another point must be borne in mind that assumed perturbation expansion destroys the 
unitarity of the S-matrix leading to such catastrophy. But in support of the validity of 
our calculation we note that the 5th order terms are small compared with the 3rd order 
ones and probably the 7th order terms are still smaller than the 5th. It may be remark- 
ed that the present procedure is not exact. To discuss fully one should use the dispers- 
ion relation for pion-nucleon scattering in a similar way as introduced by Miyazawa’ in 
the investigation of nuclear forces and magnetic moment problems. 
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© ail 7 
Puy apg = 204 We 0, 


Fyuy pig = — Up W7_ — Of Wp { (Oo +4) — 707 +4 (w.— vg) 7} + oO, 
X {7 (w+ wp) *— we" | 

Fy uy, pqk = — 204 Wp '| (Wo— wg) ~ + (we + W) | 
Puy, qph= — 20, we (Wp +9) (A) 
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Pay yopk= — 407 Oe OK 


where w,= Oe (e— #) fas ie. 
67 


In above we have written in detail only half of the F,---’s appearing in the right-hand 
matrix of equation (9); the rest of the functions can be written very easily by inter- 
changing the suffixes p and q and as well as 2 and v. It is interesting to note that one 


needs only to interchange the suffixes p and q in the right-hand side of the expression 
for F,+-. in equation (A). 
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A general formulation is given to the quantum theory of steady diffusion. In seeking for a 
steady solution of Liouville’s equation, the boundary condition is taken into account by requiring that 
the solution should lead to a given distribution of average density. The distribution is to be determin- 
ed by macroscopic law of diffusion and macroscopic boundary condition. 

The basic equation thus obtained has a form similar to Bloch’s kinetic equation and reduces to 
the latter in the limit of a system of weakly interacting particles. This is shown by generalizing a 
damping theoretical expansion of Kohn and Luttinger. 

It is found that the Einstein relation is valid only for the symmetric part of diffusion- and electric 
conductivity tensors, in agreement with Kasuya’s suggestion. 


$1. Introduction 


‘ Recently Kubo and others’”” have succeeded in formulating quantum statistical ex- 
pressions for transport coefficients such as electric and thermal conductivities. These 
formulae are just as general and rigorous as, say, the familiar expression for the partition 
function Z=Tr(exp{—H/kT}). In practice appropriate approximation should, of course, 
be made in evaluating transport coefficients. The point is, however, that the conventional 
Bloch equation is nothing else but the lowest order approximation in a damping theoretical 
treatment of dynamical motion (see § 6) and by no means the most general way of ap- 
proaching the problem. 

Now, in the case of mechanical disturbances such as an external electric field, deri- 
vation of these formulae has been rather simple. The mechanical disturbance is expressed 
as a definite perturbing Hamiltonian and the deviation from equilibrium caused by it can 
be obtained by perturbation theory. On the other hand, thermal disturbances such as 
density and temperature gradients cannot be expressed as a perturbing Hamiltonian in an 
unambiguous way. Accordingly in the previous paper,” use had to be made of Onsager’s 
assumption that the average regression of spontaneous fluctuation follows the macroscopic 
laws. As a result, certain ambiguity has been left over, concerning galvano-magnetic 
effects caused by thermal disturbances. 

In the present paper, a general formulation free from such an assumption will be 
given to the theory of thermal disturbances and, in order to show the basic idea, the case 
of steady diffusion will be discussed in detail. In this case, the usual Bloch equation 
takes the form in which the drift term due to the density gradient is balanced by the 


PS A SA A Sg 
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collision term. Our purpose is then to generalize the equation so as to include systems 
of strongly interacting particles (electrons, phonons, impurities, etc.). In other words, we 
shall seek for a steady solution of the quantum theoretical Liouville equation 


i|H, p|=0, (hel) 


where H is the Hamiltonian of our system and y the density matrix, taking h=1. The 
boundary condition should be such that a steady gradient of average density is established 
by attaching suitable sources at the boundary. But for such a boundary condition, (1-1) 
would lead to an equilibrium density matrix, microcanonical, canonical or grand canonical. 
The microscopic detail of the interaction with sources at the boundary, however, should 
not be essential to the law of diffusion as an intrinsic property of a large system. We 
need only to suppose that the solution of (1-1) should lead to a given distribution of 
average density. The distribution is in turn to be determined by the macroscopic law 
of diffusion together with a given macroscopic boundary condition. | The law of diffusion 
itself is a consequence from our solution of (1-1), so that the method is self-consistent. 

In this sense, the thermal disturbance is a constraint upon dynamical motion and in 
fact appears as an effective potential in our basic equation (§ 4). The equation takes a 
form similar to the Bloch equation and reduces to the latter in the limit of a system of 
weakly interacting particles. This will be shown by applying a damping theoretical ex- 
pansion due to Kohn and Luttinger” (§ 6). 

An important consequence of the theory is that the well-known Einstein relation is 
valid only for the symmetric part of diffusion- and electric conductivity tensors. In other 
words, as regards the Hall effect, the gradient of chemical potential is not equivalent to 
the electric field. The difference becomes appreciable at low temperatures and under 
strong magnetic fields. Such a difference has first been suggested by Kasuya and is con- 
firmed far beyond doubt by the present theory (§ 5). 

From technical points of view, the present theory is similar to the so-called method 
of pseudo-potential.” It may also be regarded as generalization of Enskog’s classical kinetic 
theory.” Indeed, Matsubara” has once tried the theory of hydrodynamic properties from 
the latter point of view. It seems, however, that neither physical nor mathematical details 


has ever been examined by these previous authors. 


§ 2. Leeal equilibrium 


For definiteness, let us take a system of similar particles (say, electrons in a metal), 
in which there exists a steady gradient of average density. The number density of 


particles is represented by the operator 


a(x) = SO (nx— x)= 2S in, Oa) 
a ‘ 


where 2 is the volume of the system and m,=>}exp{—ik-x,} should not be confused 


with occupation numbers. The average density is then given by 


Tr (¢n(%) ) =(n(x) ). (2-2) 
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As mentioned in § 1, we suppose that the right-hand side of (2-2) is given asa solution 
of the macroscopic diffusion equation, hence (2-2) is the condition for the solution of 


(1-1) to satisfy. 
Let us describe the system in equilibrium by the grand canonical distribution 


pe=exp {¢N—H} /Tr (exp {¢N—fH}), (2-3) 


where ¢ is the chemical potential, 7 the temperature and N the operator representing the 
total number of particles and commutable with the Hamiltonian H. We shall assume 
that the system in equilibrium is homogeneous so that 


(n(x) )e=Tr(p.n(x)) =n (2-4) 


is constant. Hence (2-3), though stationary, does not satisfy (2-2) when the density 


gradient does exist. 
A possibility of satisfying (2-2) is given by the so-called local equilibrium distribu- 


tion 


Pr =exp{(¢+¢N—SH+ S¥s;,n_,}- (2-5) 


P=) 
Here ¢ is a normalization constant to make 
Tr(¢,)=1 (2-6) 


and the ¢;,(k+< 0) represent the fluctuation of chemical potential in space. From (2-6) 
it can be easily seen that 


(n_p) = — O¢, Ox; F_,.= — dS, 0 <n,» (2-7) 
where the entropy S is defined as 
S=—Tr(e, loge). (2-8) 


Obviously, (2-2) is always satisfied by (2-5) with suitably chosen €,. But (2-5) 
does not satisfy (1-1), [H, @|0. 
We are thus led to assume the density matrix in the form 


P= +, i (2 : 9) 


Actually, however, no approximation is introduced in so far as writing the density matrix 
in this form. For any given , we can always find (2-5) with parameters ¢, €, 3 and 
¢, so chosen that 


Tr(¢) =Tr(@,), Tr(oN) =Tr(e,N) 
(2-10) 
Tr(oH) =Tr (9H), Tr (ony) =Tr (eo, n,) . 


Then (2-9) can be regarded as the definition of 1, which should necessarily satisfy 
Tr(¢,) =0, Tr(p,N) =0, 


2-11 
Tr(pyBl 0, element, ek 
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There are an infinite number of density matrices which satisfy (2-10). Among 
them, (2-5) is characterized by the maximum entropy and makes it possible to introduce 
thermodynamical variables as shown by (2-7). This is the reason for our particular 
choice (2-5), because the boundary condition of our problem is most simply expressed in 
terms of thermodynamical variables by specifying chemical potentials of sources at the 


boundary. 


§ 3. Linear approximation 


Now let us assume that the system is not far from equilibrium so that both ¢;, and 
?, are small quantities of first order. Then the well-known expansion formula of ordered 
exponential leads to 


A He . 
p= pe(1+ S14 {Fn 4(=i0)) G-1) 


0 
where we have made use of (2-4), i.e., 
(nm) =, k == 0 (3 é 2) 
and m,(—iA) is obtained from Heisenberg’s operator m;,(t) =exp(iHt) -n,,-exp(—iHt) by 
replacing ¢ with —i/. 
Under the linear approximation (3-1), the average density is given as 
(ny) = Tt (Ore) =I E (3-3) 
where 
ey 
y) - 
we < tu (—id) me: (3-4) 
out 


In particular, assuming 7, is continuous at k=0, we can show that 


lim 9, = 2 (0n/O¢) (3-5) 
k>0 


where n is defined by (2:4). From (2-7), the entropy is expressed as 
S=S,—4 >) Die. (ny) (n_x.) (3-6) 


with the equilibrium value 5). If the density fluctuation is gradual, i.e., only the Sy. 
with small k are excited, we may replace gy, in (3-6) by (3-5), so that we obtain the 
well-known expansion of entropy in thermodynamics. 

Let us now turn to the flow density which is defined by 


v(x) =$>) (x0 (x—x,) +0 (x—x,) x;) = 2 > 0, e*, (3:7) 


In cases of neutral particles and also of charged particles without magnetic fields, the 
local equilibrium (3-1) makes no contribution to the average flow. In the case of charged 


particles moving in a magnetic field, on the contrary, the flow does not vanish even in 
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equilibrium, because there exists a diamagnetic current. This current, however, can be 
described in terms of a magnetization and, in particular, never gives rise to a net current 
flowing through a cross section of a conductor (except for superconductors). So we here- 
after disregard this part of flow. 
There still remains the flow due to the second term of (3-1) 
(i.),—= Se sf (n_;,(—i/) Op )e- (3-8) 


g 


* 
0 


This vanishes if there is no magnetic field. Because then », is invariant and m;, changes 
its sign under the transformation of time reversal. In the presence of a magnetic field, 
however, the transformation of time reversal includes the reversal of the magnetic field, 
too. So we can infer only that (3-8) is reversed together with the magnetic field. In 
other words, the flow is antisymmetric with respect to the magnetic field. But it does 
vanish in the classical limit and therefore in the high temperature limit, too. In fact, 
in the classical limit, we can first perform the integration over particle velocities in taking 
the expectation value in (3-8) and this vanishes because it is linear in velocities. 


A more detailed discussion of (3-8) will be given in § 5. 


§ 4. Basie equation and its solution 


Now inserting (2-9) together with (3-1) into (1-1), we obtain our basic equation 


the ee ae dt A 
iL H, aj=- =x | 3 en_»(—i4) (4-1) 


where, of course, 


ny, =i|H, n,]=—ik-d,. (4-2) 


A particular solution of the inhomogeneous equation (4-1) is given by 

4 = | ae e7*! ett Peat (4-3) 
where /’ stands for the right-hand side of (4-1). Im fact, (4-3) is the solution which 
we are seeking for. This can be seen in the following way. 

First, note that the macroscopic relaxation time, i.e., the relaxation time in which 
the system recovers equilibrium, is of the order of /?/D. Here 1 is the linear dimension 
of the system and D the diffusion coefficient. Now suppose that the system was in 
equilibrium at the remote past, t=— oo, and that the difference of chemical potentials 
at two ends of the system has been increased very slowly from zero to the present value 
(t=0). For instance, suppose that it is proportional to exp(&t), where 0<€<D/P. 
Then we may assume that a steady diffusion is established at each instant of time from 
f=—co to t=0. This is a sort of adiabatic change and may be represented by the 


a 
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solution proportional to exp(€&¢) of the following Liouville equation 

0, +i[H, pesenl 

0,(— co) =0, €5+0. 
At t=0, we have 

éo,+i(H, o,J=l (4x4) 
whose solution is nothing but (4-3), as can be easily confirmed. 


It still remains to be confirmed that (4-3) does satisfy our subsidiary conditions 


(2-11). For any dynamical variable a, we have 
SV Ee fc —et ( dh 3 v7 
Tr(, a) =—SVe,| dre | 2 (nn ( a). (4-5) 
0 oi! 


If we take a=1, the left-hand side is Tr(,), and the right-hand side contains (ieee 
This should vanish, because any flow in equilibrium is steady. The first condition in 
(2-11) is thus satisfied. As for the second condition, we take a=N. Since N com- 
mutes with the Hamiltonian, the right-hand side of (4-5) contains (n_,N ecm Lnis 


should also vanish, because 


ees © casgeiNy) = 


OTe (et n_,,) =0. 
Thus the second condition is satisfied. Similarly, it can be seen by taking a=H that 
the third condition is also satisfied. 

As for the final condition in (2-11), we have 


ete ietads : 
Tr(e,n.) = —iks,,- \ dite | i {pix (— iA) ny. (t) ye (4-6) 

0 0 : 
where we have made use of the continuity equation (4-2). In general, (4-6) does not 
vanish. We should, therefore, restrict ourselves to the case where the density fluctuation 
is so gradual that we may replace (5, in (4:6) by (VN)... Here V is the net flow, 


i.e., v, with k=0. Since there is no net flow in equilibrium as mentioned before, we 


have 


Tr(ett* VN) == Tr(e* FV) (0), 


Thus, under our restriction, the fourth condition in (2-11) is satisfied. 


The restriction can more precisely be expressed in the following way. In terms of 


space coordinates, (4-6) is written as 


i 


Tr(p,n(x)) =— \ dx! R(x—x’) VF (x') (4-7) 


with 
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r di . ¢ 
R(x—x!) =| dee| = (p(x’, —is)n(x, t))e- 


} 
‘ 


0 0 


Except for such singular systems as superconductors and superfluids, the relaxation function 
R(x) will decay off within a finite distance. The gradient of chemical potential ['s 
should be nearly constant over this distance. This is the precise formulation of our 
restriction. If the condition is not satisfied, the simple themodynamical notion of diffu- 


sion is not applicable in describing the phenomenon. 


§5. Diffusion constant and Einstein’s relation 


Now, we take @=w;, in (4-5), then the flow arising from , is given as 


o i 


eae 
0 3 


3 


(oy, = Tr (P,P%) = —F | dt e~* (n_,(—id) v(t) >. - (5-1) 


0 0 


Making use of the continuity equation (4-2), we see that in the limit of k—>0 the 
diffusion equation takes the form 


= 1 pe 
(Unu)1 = — > a (ik, <3), (5-2) 
BS v=x, Y> z- 
In terms of space coordinates, this is written as 


7 a= 
(.(2)=— Spe =) , (5-3) 


Here the diffusion tensor D,, is given by 


ao 


e 


af a ; 
DY =| dre |S v.00). (5-4) 


where V is the net flow. 


On the other hand, it has been found’ that the electric conductivity tensor is given, 
in general, by 


= a. 
Cy = | dt em“ \ da Ji) Ju (Ye (5-5) 


where J=eV is the electric current and e the charge of the particle. Comparing (5-4) 
with (5-5), we obtain the well-known Einstein relation 


Cw =PeDS. (5-6) 


It should be remembered, however, that in the presence of a magnetic field we have the 
flow arising from the local equilibrium distribution, (3-8), which is antisymmetric and 


thus makes contribution to the antisymmetric part of the diffusion tensor. Thus we 
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obtain the important conclusion that the Einstein relation is valid only for the symmetric 
part of diffusion and electric conductivity tensors. 

In order to see the situation in more detail, let us take a system of electrons in a 
metal. For simplicity we shall neglect the interaction with phonons, taking account of 


impurity scattering alone. The Hamiltonian then takes the form 


H=}H'(x)), 


H! (x) =7_-(p-+-A(x) ) +9(#), 


where A is the vector potential of the magnetic field and ¢ the scalar potential including 
both periodic and impurity potentials. Hereafter an operator of one electron will be 


primed as H’. Let us introduce the system of one electron eigenfunctions 

je ch i aa 5a ame 
Then the Hamiltonian can be written as 

HSE /4,*%. (5-7) 
where 4,, 4,* are destruction and creation operators of electrons in the r-state. Similarly 


ny, = Sree, dae a;, 


(5-8) 
p= > rl, ea, ade, 
where 
ny! ae eee Dx! ==} (x eile 1 eile x) : 
x=i[H’, x]. (5-9) 
Remember that 
eiitt g * q e tq de 
and also that 
(a, dy Apy * dey ye =Oes1 Dypt TEs) (1—f(E,)) 
where f is the Fermi distribution function 
f(E) =[exp (PE—£) +1]”. (aa20) 
Then it is easily seen that (5-1) can be written as 
: P 
ee f(E,.) —f(E,) ' of |r0 E.—E,) + : (5 . 1) 
(We) = — Fhe Ds B(E,—E,) (s\n lr) (r| wy, |5) 7O(E, Py) eee 


where P indicates taking the principal value. The flow arising from this part is anti- 


oe . . 3 s 
symmetric, whereas the flow arising from the 0-function is symmetric. 
Similarly, (3-8) can be written as 


Cor) iy pal oa ose (s\n!_;,|7) (r| v4! |5)- (5-12) 
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For the sum of those terms in which E,*E,, we can make use of 
(s\n!_,,|r) =i(E,—E,) (5|_x\7) (5-13) 


so that the sum exactly cancels the antisymmetric part of (5-11). On the other hand, 
the terms with E,=E, in (5-12) can be transformed as 


: : ew sl OEE Y 
Tapa . Pie (s| E (np, op + Dx! 2_p) |S) —Fke pap oS f 
laa 8 JE = E,+F, fy) JE 
x4 {ds|n_p|r> Clog! |5) + (5|0,’ |r) (r]n_a|5)}- (5-14) 


From (5-9) we see 
fg ’ Z 
4 (n'y, Dy +0; nj.) =X 
so that the first term of (5-14) vanishes. For the second term, we can again make 


use of (5-13). Taking the limit of k—>0 we find the antisymmetric part of the diffu- 
sion tensor 


he ea Eames Bi 
Siew DE Mi EE) 


A wr ee y fly Ny eee ; 
1p ae A {(s|xylr) Cr|xy|5>— (s|xylr) Cr]ay|spfp- (5-15) 


On the other hand, the antisymmetric pait of the electric conductivity tensor is given as 


1 ot) = yr FE.) —f(Es) J my Xa {dslx,|r) Crlx,|5)>— Cslx, |r) Cr]x,|s>}. 
ee. BE HE) hig Bye ee “aes 


(5-16) 
Now, characteristic frequencies of the electron velocity (s|x/r)exp{i(E,—E,)t} are 
—I1 


cyclotron frequency «,, collision frequency 7~', and also the average interval of interband 


transitions JE. If all these frequencies satisfy 


Bare < 1 3 


Bra' <1, BdEA1 (5-17) 


then the difference quotient of f in (5-16) is practically the same as the differential 
quotient in (5-15), and we have the Einstein relation for the antisymmetric part, too. 
It is to be noted here that the second of (5-17) is the well-known criterion for ap- 
plicability of Bloch’s kinetic equation, although actually this is too stringent. 


$6. Damping theoretical expansion 


Finally, we shall show that our basic equation (4-1) reduces to the usual Bloch 
equation in the limit of a system of weakly interacting particles. In fact (4-1) has 
mathematically the same form as discussed by Kohn and Luttinger” in the case of a 
steady electric field. They have dealt with a simple model in which an electron is 
scattered by impurities. Actually their method can be applied to integration of the 
Liouville equation in general, provided that the Hamiltonian satisfies certain conditions. 

First let us introduce a symbolic method due to Kubo. Define the linear operator 
£ which operates always from the left on any dynamical variable 7 as 


= 


— oo 
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49 =i[H, 7]. (6-1) 


Incidentally, the operator of this sort has been introduced in classical statistical mechanics 
to prove the ergodic theorem.” The Liouville equation can be written as 


(<-+£)o@ =0. (6-2) 


Let us introduce the Laplace transform 


p(s) = | dt e=™ o(t) (6-3) 


which converges for ts>0. Then (6-2) is transformed into 
(sr £) p(s) =o (6-4) 


where , is the initial density matrix at t=0. Replacing o(s) by ¢,, ~ by I’, and 
taking the limit s—>+0, we obtain our basic equation (4-4). But we shall be concerned 
with the general case (6-4) for the moment. 


Now, assume that the Hamiltonian takes the form 
H=H,+9h; (6-5) 


where g is a small numerical parameter indicating the order of perturbation H;. Cor- 


respondingly the operator £ splits as 
ep etagei id aot (6-6) 


Let us fix the basic state vectors |a’) as 


Fi y|\ a =o 2 


In the case of the electron-phonon interaction, for instance, a stands for a set Of occupa- 
tion numbers of free electrons and phonons. We introduce linear operators ) and 7 


which also operate from the left on any dynamical variable as 
Ka| Dyla’) =(aly|a) dee, 
Call (7 a") = (a\g| a) (10a 


Without loss of generality we assume that H,=0. Following Kohn and Luttinger, we 


decompose the density matrix into diagonal and non-diagonal parts 
1(s) =Pa(s) + Pn(s)s 
i Va | (6 ; 7) 
fa= Do, pr= VAG : 


Inserting (6-7) together with (6-6) into (6-4) and taking diagonal and non-diagonal 


parts of the equation respectively, we obtain 
= Ty) 
SPa (s) +9 L1Pn (s) = Dr, 


(6-8) 
(s+ Ly +97 41) fn (5) == Hy —IL1 0a) 2 


958 S. Nakajima 


Eliminating , from these, we have a formal solution for the diagonal part 


1 1 ie 
—¢ ee Sa oe) Se Dae #0, J... (6-9) 
50a (5) PDL, +L + gL: Li f'a(s) ( s+ L.4+9N Li ) 
We can derive a power series expansion from (6-9), supposing that 
Pals) =O(9~) (6-10) 


and making use of 


1 1 1 a4 
= = - gM £1 ee eS Othe (6-11) 
S++ Lot 9 Gy Re te a st+Lo 
In particular the lowest order approximation satisfies 
6) 2 Q) . 1 : @® = ¥ . 
sea (8) —9 DL Lr)? (s) = Dfro- (6-12) 
wae ae 
Remember that 
oe ee | exp (— st) -exp(—iH,¢) -7 exp(#H,t). 
Then, taking the explicit matrix representation of (6-12), we find 
sP,(s) — 9?) + |(a| Hye’) |? (P,! (s) —Pa(s)) =Calapla), 
a $+ (We— Ww)” (6-13) 


P,(s) =Cale® (s) |e). 
In the limit of s—>+0, we have 


= 2s = 
P+ (we—War)? 


_~ 270 (wWe—Wa)- 


Hence (6-13) is nothing else but the Laplace transform of the so-called master equation 


P,(t) = S327 g?|(a|H,|a’)|29 (we—wa) (Par (t) — Pa (t)), 


(6-14) 
P,(t=0) = (al ,|@). 


which van Hove’ has obtained by means of rather a lengthy expansion. 
Now replacing » by ,, by J’, and taking the limit s>+0, we have 


27g? S}|(a|H,|a”)|20(we—war) (Fer — Fy) =a|T"|a’) (6-15) 
ar 


where F, is the diagonal element of 7, in the lowest order approximation. The equation 
(6-15) says that the drift term due to the gradient of chemical potential is balanced by 
the collision term due to perturbation H;. More precisely, F, is still a many-particle 
distribution function and we have further to reduce (6-15) to obtain the Bloch equation 
of one particle distribution function. 


Of course, certain conditions should be satisfied in order that the transport equation 


LL EDA AL A EL IE 


ae 
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(6-12) be already a good approximation. Smallness of the parameter y is by no means 
sufficient. For instance, in the case of an electron scattered by impurities, the contribu- 
tion of the second term in (6-11) relative to the first would not be simply proportional 
to 7, but to gy, if impurities were on regular lattice points. In this case the transport 
equation can never be a good approximation, because g2 can be increased indefinitely by 
increasing the volume 2. In general, the size dependence of perturbation in a large 
system plays an essential role here. We shall not enter into this problem, as the detailed 
analyses have been given by van Hove.” 


§ 7. Conclusions 


Quantum statistical mechanics of steady diffusion has been formulated. The theory 
is the most natural generalization of the Bloch kinetic equation and in fact reduces to 
the latter under certain conditions. An important conclusion is that the Einstein relation 
is valid only for the antisymmetric part of diffusion- and electric conductivity tensors. 

Obviously, the theory can be generalized so as to include heat conduction and viscosity, 
‘which will be discussed in a subsequent paper. 

The expressions for transport coefficients derived in this way are general and rigorous. 
In statistical thermodynamics, we have the general expression for the partition function 
and introduce approximate methods in evaluating this; the virial expansion in the case 
of imperfect gases, normal vibrations in the case of crystals, and so on. In just the same 
way, we should introduce appropriate methods of approximation to evaluate transport 
coefficients, starting with our general expressions. Thus we may couclude that quantum 
theory of transport coefficients now stands on the same level as statistical mechanics of 
equilibrium properties, though admittedly we know few methods of approximation such as 
Bloch’s kinetic equation which is nothing but the lowest order approximation in a damp- 


ing theoretical expansion. 


References 


1) H. Nakano, Prog. Theor. Phys. 15 (1956), 77. 
R. Kubo, J. Phys. Soc. Japan 12 (1957), 570. 

2) R. Kubo, M. Yokota, and S. Nakajima, J. Phys. Soc. Japan 12 (1957), 1203. 

3) W. Kohn and J. M. Luttinger, Phys. Rev. 108 (1957), 590. 

4) HB. Callen, M. L. Barasch and J. L. Jackson, Phys. Rev. 88 (1952), 1382. 
R. Kubo, N. Hashitsume and M. Yokota, Busseiron Kenkyu, 91 (1955), 32. 

5) S. Chapman and T. G. Cowling, The Mathematical Theory of Non-Uniform Gases., Cambridge, 
1939. 

6) T. Matsubara, Busseiron Kenkyu, vol. 2 (1957), 190. 

7) EE. Hopf, Ergodentheorie, Berlin, 1937. 

8) L. van Hove, Physica 21 (1955), 517; ibid. 23 (1957), 441. 


960 


Progress of Theoretical Physics, Vol. 20, No. 6, December 1958 


Foundation of Deformed Potential Model for Nuclear Rotation 


Kiyomi IKEDA, Minoru KOBAYASI, Toshio MARUMORI,* 
Sinobu NAGATA and Takanori SHIOZAKI 


Department of Physics, Kyoto University, Kyoto 


(Received August 15, 1958) 


Following the prescription suggested by Bohr and Mottelson, we develop a formulation of the 
separation of the nuclear Hamiltonian into rotational and intrinsic parts. In the light of this formula- 
tion, the foundation of various methods with the deformed potential model for the estimation of the 
moments of inertia for nuclear rotation is discussed and the interrelation among these methods is clarified. 
Treatments are restricted to the two dimensional case for simplicity. 


$1. Introduction 


The fact that the existence of rotational spectra had been confirmed in certain regions 
of elements brought forward the problem of finding the transformation which leads, at 
least approximately, to the separation of the nuclear Hamiltonian into intrinsic and rota- 
tional parts. 

The early workers’) in this field succeeded in indicating the existence of rotational 
states of nuclei qualitatively, but they could say nothing quantitative about the values of 
the moments of inertia associated with these rotational motions. They gave much smaller 
values for the moments of inertia than those empirically determined. It has become clear 
in time this discrepancy originates in the poor separation of the nuclear Hamiltonian. 

On the other hand, various prescriptions for the calculation of the moment of inertia 
without recourse to the coordinate transformation were also proposed by many authors.* —*) 
Among those, the so-called generator coordinates method®* and the cranking model‘ —? 
are most typical. The common characteristic of these methods is that the wave functions 
cotresponding to the independent particle motions in a deformed Hartree field are used 
as the model wave functions to evaluate the moment of inertia. Hence we may call them 
the methods with the deformed potential model. 

Many authors derived different expressions’ for the moment of inertia of the rota- 
ting nucleus by making use of this deformed potential model. But, it was difficult to 
discuss the relative merits of these results, because the physical foundations of such calcula- 
tions were not so clear. Further, as was pointed out by Lipkin,” the question that at 


what stage of the calculation the model wave function is to be adopted has been left 
open. 


Thus, there are two problems in our face: the first problem is to find the compact. 


* The author thanks the Iwanami Fujukai for the financial aid. 
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form of transformation which leads to the optimum separation of the nuclear Hamiltonian 
into rotational and intrinsic parts; the second problem is to show how and at what 
stage of transformation the appropriate model wave functions should be applied in the 
calculation of the moment of inertia of the rotating nucleus. 

As for the first problem, Bohr and Mottelson'” have recently suggested a very in- 
genious prescription for the separation of the nuclear Hamiltonian by introducing a kind 
of canonical transformation which adopts collective orientation angles depending on both 
the positions and the momenta of the nuclear particles. Indeed, the ordinary point trans- 
formations leave so large an amount of coupling terms that they do not even approximately 


separate the Hamiltonian into two parts.”!~"™ 


Further, these authors gave a condition 
for separation of the nuclear Hamiltonian in the form of operator equations, though the 
satisfactory test for the applicability of this condition in the case of real nuclei is impos- 
sible at the present stage of the theory. This problem seems to be closely connected 
with the nature of nuclear forces. That is, the criterion to the separation of the nuclear 
Hamiltonian depends essentially on the interactions between nucleons. 

With regard to the second problem, it must be emphasized that the model Hamil- 
tonian with the deformed potential is not invariant under rotation, so that the model 
wave functions cannot be the eigen-functions of the total angular momentum. This situa- 
tion causes some confusions in the calculation of the nuclear moment of inertia. The 
moment of inertia calculated by this model Hamiltonian does not correspond to that for 
nuclear free rotation, but to that for the spurious oscillation about the equilibrium orienta- 
tion in the deformed potential fixed in the space’. 

In this paper, we shall first, in § 2, develop the method of canonical transformation 
introduced by Bohr and Mottelson'” in the formalism of the redundant variables with the 
subsidiary condition and show on what condition the separation of the real nuclear Hamil- 
tonian is performed. Next, in § 3, the foundation and the validity of various methods 
with the deformed potential model will be discussed from the standpoint of the canonical 
transformation formalism developed in §2. The question raised by Lipkin” will be 
answered by showing the step of the transformation at which the model wave function 
should be applied. Finally, in § 4, we shall discuss the validity of the various expressions 


for the moment of inertia of the rotating nucleus obtained by many authors. 


§2. Separation of the nuclear Hamiltonian into 


rotational and intrinsic parts 


Let us consider such an even-even nucleus but not of a closed-shell configuration that 


exhibits rotational spectra. The real Hamiltonian H, is 


A 
Hea Lis (2'=1) 


i=1 i<j 


which is invariant under the spatial rotation. Here T, is the kinetic energy of the i-th 


nucleon and v,; is the interaction potential between the i-th and the j-th nucleons. 
e . 
For simplicity we restrict ourselves to a two dimensional problem. Since we are 
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interested in the rotational spectra of the form 


ae ee i (2-2) 
27 


our Schrodinger equation may be given in the form 
FA, DBs x4) =Ey Pe Pee X4)5 (2- 3) 


where YJ, is the eigenstate belonging to a particular rotational band «; and having an 


eigenvalue I of the total angular momentum I. 
We shall adopt the method of the canonical transformation with the redundant varia- 


bles by introducing redundant variables 4 and I, which satisfy the commutation relation, 


[I,, 9J=—. (2-4) 
Then, the Schrodinger equation (2.3) becomes 
Hy Fx (%1,°**, Xa, O) =Eg Fey (% 5°“, Sa, O) (2-5a) 
with the subsidiary condition, 
1, Fa (eg +5845 8) =. (2- 5b) 


This subsidiary condition is formally equivalent to the condition 


a ean: oe tM an, SD AB, er 


GB. (a),---, Xa, 8) =0, 


which is commonly used in the redundant variable method, but our condition (2.5b) is 
more physical than the usual one and is easily extended to the three dimensional case.* 
Next, we perform the following canonical transformations successively," 


Ly 8 = (U,U,U,) Pe ys (2-6) 
U,=exp( — : o 7.) (2-7) 
U,=exp( + : a 1), (2-8) 
U,=exp( — , oh) (2-9) 


where I is the angular momentum of all particles in terms of the particle coordinates 
T=S3(% py. pd, 


and & is a collective orientation angle, which is. generally a function of both positions 
and momenta of the particles. 


As was shown by Bohr and Mottelson,”” if I and @& satisfy the following conditions 


: (Pies =, (2-10) 


d 


* On this point, see reference 14). 
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I, }J=—i, (2-11) 
the transformed Hamiltonian and the subsidiary condition are 
Pee(WU,Uo Ue eT a ee r), 2-12 
1 1U 2: 2H 0 ( 0 28 ( a) 


I ye Catia Xs, @) =O. (2 ‘i 12b) 
This representation is the so-called collective representation, where rotational and intrinsic 


parts are completely separated. In fact, Hamiltonian H, contains neither & nor I, that 
is, it satisfies 


[H,, 0]=[H,,.1]=o. (2-13) 


This implies that we have displayed the dependence of the Hamiltonian (2.1) on the 
total angular momentum without making explicit use of the intrinsic variables which 
should be complicated functions of particle variables x, p,.* 


In this collective representation, ¥,, is written in the following form, 
3 : Ey, 
P,,' - Ceo X 4, () =1,(4) Q;(Xy°-°, %4) : (2 rs 14) 
Here the intrinsic wave function 9,(%,,---, %,) and the rotational wave function 7,(() 
satisfy the following equations respectively, 


as. 
Ay Os (High Je. (X77, ¥4) =O; (H,,"*+, ¥4), (GEESE) 


To, (x,,--, %4) =0, (2-15b) 
v5 
2A 


and the energy of the system becomes 


I 11(0) ee 


2 


Px), (2:16) 


1 
E,,=¢,+—!, (a2 
; 2 


as was expected in the beginning of this section. 

If we use the Hartree approximation in (2.15a) at this stage, the wave function 9; 
in (2.15a) which describes the intrinsic motion is replaced by the model wave function 
describing the independent particle motion in the deformed field. Then, in general, this 
model wave function cannot satisfy the subsidiary condition (2.15b). On the other hand, 
it is inevitable to use the model wave functions in order to give the explicit expression 
for the moment of inertia. In fact, if we adopt the correct wave function 9,, the ex- 
pressions for the moment of inertia, obtained by Inglis” and Skyrme” respectively, 


* In this representation the total angular momentum becomes 


—Ij)= (U,U2U3) 7} I(U,U,U3). 
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Je bet (i|I?|i) o\> 
) — (| H, GED ; 


f= 

2 0 
have no significance because of (2.15b). Here it is noted that the energy-eigenvalues 
¢, used in the above expressions are those of H,.. Consequently, such an introduction of 
the model wave function at this stage seems to contain a serious inconsistency in itself. 
It seems to us that these circumstances caused some confusions in the previous calculations 


of the moment of inertia. In § 3, we shall discuss this problem in detail. 


0 


§ 3. Intreduction of the model wave function 


In order to search for the representation which makes possible to introduce the model 


wave function, let us return to the representation of the second step, 


Fa =U, Ue FF (3-1) 
In this representation the Hamiltonian H, takes the same form as H,, 
Hy= (UU) Hy UUy= 5 I +(e 5 gt) (3-2) 
27 
but the subsidiary condition is 
(1, +1) ¥2% =0. (3-3) 
Let us further make a time dependent unitary transformation, 
P..=U™ ¥,,;™ ¢3*4} 
“‘Goe A 
U=exp( — ' EES Po) he : 
Wes wey: ar (3-5) 


The wave function @;, satisfies the following equations 


4 OD, 1 
}—_—_ =| 1 -—— Fw, , . 
iD Py ( ) 25 ) ? (3 6a) 
(1,+D9%,,=0, (3-6b) 
and the equation of motion of (/(t) is 
di (t) a es | 1 
. =y = me * a —— ny Pa . 
7 w slay o> A(t) A I, (3-7) 
The operator H,— 2 at in (3-6a) is time-independent, so that (3-6) satisfies the 
following eigenvalue equation, 
(H.-P r= Our (3 -8a) 
24 b % 


2 ee 
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(1, +1) @;,;=0. (3-8b) 


Here it is clear that the eigenvalue ¢; has the same value as that of €; in (2.2). Taking 


into account the subsidiary condition (3.8b), the equations (3.8) can be written in the 
form, 


i 
(H.-S D;)(X5°>°, X45 i) mn OD; (%°°, X 4) @), (3-9a) 


(+1) %;;=0. (3-9b) 


It should be noted that %;, is the eigenstate of the total angular momentum. 


On the other hand, however, the wave functions 


Lens Pog (Myers Ha) Xz (O) (3,10)% 


generally satisfy also the same equations as (3.9), 


1 2 =— 
(H.— ie \O= ey ?,, (3 s 11a) 
(+1 @;=0. (3-11b) 


Here 7,(0) and W,, are defined through (2.16) and (2.3), respectively, and a,’s are 
arbitrary coefficients of superposition. This wave function %; is not the eigenfunction 
of the total angular momentum. In fact, if we transform @; back into the original 
representation, it will be confirmed that the transformed @; is not an eigenstate of H,, 
but a superposition of the eigenstates of H,. In spite of this property, (), satisfies the 
subsidiary condition (3.11b). This is due to the particular choice of the present representa- 
tion. 

This fact assures the possibiliy of replacing the actual wave function %,, by the 
model wave function Y, which is not the eigenfunction of the total angular momentum, 
so long as we calculate the intrinsic energy €; characterizing the rotational band. Since 
the coefficient a; in @, is not restricted, it might be possible to replace @; by the model 
wave function @ used in the deformed potential model. Anyhow the model wave func- 
tion @” in the deformed potential model should be introduced only in this representation. 

From this point of view, let us consider the validity of the model wave function 
@™” used in the deformed potential model. The model wave function 0” (x, (#) may be 


introduced as follows. Remark that 
i ‘ sate 
(x, 0) = exp( — | 1 \0,(x, 0), (3-12) ** 
; D 


@, (x, 0) = Dla; Fez (%1,°°*, Xa), (3-12b) 
yi 


* Here %7*(0) is complex conjugate to %7(0). 
** x in @;(x, 0) stands for a set of %4,%2,°"+,%4- 
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and @;(x, 0) satisfies the following equations, 


(H.-F ).(x, 0) = .(x, 0), (3- 13a) 
. 24 
I, 9,(x, 0) =0. (3-13b) 


Here, when the model wave function @/"(x, 0) in the deformed potential model is 
adopted in place of ;(x, 0), DY (x, @) is given as 


OM (x, 6) =exp{ — ; Id OY (x, i... (3-14) 


Now, if we expand @* (x, 0) in terms of the complete set %,(x,,---, X4) which are 
the eigenstates of H, and I, @™ (x, 0) may be expressed generally as follows, 


OY (x, 0) =>) 4; P( #7, x4) (3-15) 
k 
= 516; 9;(x, 0) +terms not belonging to the rotational states. 
j 


Therefore, 
Qi (x, 0) =b, D;(x, 6) + S36, O;(x, 7) +terms not belonging 
Pet 
to the rotational states. (3-16) 


Here, we have taken into account that @;(x, 7) is a superposition only of the rotational 
states belonging to a particular band €; in all eigenstates of H). 

The second and third terms in the right hand side of (3.16), which are deviations 
from (3.10), decide the validity of the adopted model wave function @!(x, #). These 
terms have caused the serious confusion on the calculation of the moment of inertia. We 
shall discuss this point in detail in § 4. 

On the other hand, the above formulation gives new light upon the various methods 
with the deformed potential. At first, it is to be noted that the equation (3.11a) is just the 
same as those satisfied by the extended wave function F(x,,-:-, x4, ?) in Skyrme’s method”. 
This equation is the starting point of his variational method. In his method, emphasis 
is placed on a band of rotational states rather than a single state and there the additional 
collective coordinate # is not redundant. 


Next, if we multiply the equation (3.10) by 7%,(@) and integrate over #, we obtain 


Warley 84) =—\La(O) Oe (ayo, Ray OAD. (3-17) 
a 
This expression of ¥;, is simply that given by Peierls and Yoccoz’s generator coordinate 
method” when %;(x,,++-, 44, 4) is replaced by the P(x, @). However, it should not 
be confused that @” does not correspond to the Hartree approximation of Y,,, but to 
that of the intrinsic wave function @,,. 

Thirdly, let us consider the cranking model proposed by Inglis and others.0-” In 


this model the rotational motion is treated in a classical way. In our case this picture 
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corresponds to (3.6) and (3.7), where the rotational motion is described in the Heisen- 
berg picture. In the cranking model, emphasis is placed on the motion of ((t) and on 
setting up a deformed potential explicitly as an operator. Therefore, the equation (3.6) 
is written as follows, 


aD CM A A d , 
={ST ADV GI) 0% SH OM, (3- 18a) 
i t=1 


\i=1 


=. 
& 
| 

| 


(4+) 0 =0. (3-18b) 
The angular dependence of V is required to satisfy the condition, 
Pe tcone Feo Sen (3-19) 


In this case, it should be kept in mind that the Hamiltonian H°” is time-dependent. 
The equation (3.18a) is just the starting point of the cranking model. 
In the cranking model the calculation of the moments of inertia is performed on 


the body-fixed rotating coordinate system. This corresponds to the transformation, 


PRR (Os peu (3-20) 
R(t) =exp( —  1H(0)). (3-21) 
b ; 
Then (3.18) become 
ap a ake {Sor,+ SY, (r;, ¢;) = 1a (p! cel 3 -22a) 
t ra j=1 
I, Dp On ea) (3-22b) 


where w’=R “wa R. 
If w’ is treated classically, the moment of inertia is obtained from the calculation of the 
second order perturbation with respect to w’. It should be noted that the unperturbed 
wave function corresponds to the @i/(x, 0) in (3.15). 

Thus, it has been made clear that the model wave function must be introduced in 
the representation (3.4). In this representation the relation among the various methods 


with the deformed potential model becomes evident. 


§4. Expression for the moment of inertia 


Now, let us consider the expression for the moment of inertia A in the representa- 


tion (3.4). The following operator equations can be set up in this representation, 


1 Lise sg iL 

; —— I, 0 |=, Acad 

b [Hs 24 | A ae 
Aran ay (4-2) 


where @ is assumed to have the same functional form as that defined through (2.10) 
and (2.11). 
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Let us solve these operator equations by using D(x, 0) and O¥ (x, #). Then we 


obtain the usual expressions for A respectively””””,* 
Wi ir . 12 
Jars Mo! (4-32) 
ri Gs <5 
\/qya 1PM 12 
fear I P y, ; (4-3b) 
rae M M 
i+ €j — &i 


The model wave function @;, which is not an eigenstate of the total angular momentum, 
seems at a glance to give the meaning to (4.3a). However, (4.3a) has no meaning 


actually, because 
(@,|1|9;> =0 for ix<j 


as easily seen from (3.10). 

On the other hand, the model wave function @” in the deformed potential model 
has non-diagonal matrix elements of I, so that (4.3b) has a meaning. This circumstance 
results from the existence of the deviation terms in (3.16), which are due to the fact 
that @" is not exactly an eigenstate of (H,— (1 24) 1°). Therefore, the relation be- 
tween the real moment of inertia and $8 in (4.3b) becomes obscure. From this point 
of view, the validity of the cranking model formula for + may be doubtful. 

If we obtain the expression for 2 being meaningful even for 7, it may give the 
real moment of inertia in principle. 

. Another interesting expression for | +, which has significance even for @,, is derived 
as follows. Operating I> on the left through (3.11a), we have 


1 
I? H,——— I? )@, =, °° @,. 4-4 
( ) 24 0 ) € ( ) 
Therefore we get 
GH, |i) — *— GPT, {|i) =e CEN). 


24 


Taking into account the relation 


This expression (4:3) is derived through the following relation, 


(O.\ [LHe (1/28), 81, 9]0,) <2) [Ao— G 2A) Mo, 31 \05)1° 
e jx €)— 
Because explicit use is made of matrix multiplication, it must be guaranteed actually that the state 3O; 
remains in the space of the set of eigenfunctions O,. Therefore, strictly speaking, # in the above relation 
must be replaced by the periodic function in & with period 27, e. g., sin 3, and the expression of A, (4-3), 


pause be modified slightly. However, as discussed by Lipkin et al.,® the expression (4-3) holds approximately 
in the case where the deformed potential model wave function is used. 
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yy ~ 


i|Ty°|t), 
ms i, 


€;= (i|H,|i) he 


~we obtain 


(i|PH,\i) — G|P |G) Holi) 


Since @; satisfies the subsidiary condition (3.11b), (4.5) becomes 


Jai GERID— GOGH (455) 


go _ din =Grpe 7 
P=9 GP HI) —GEDGHD, ag 


This is just Skyrme’s formula, which reduces to that obtained by Peierls and Yoccoz 
under an appropriate approximation. It it to be remarked that if we use the real wave 
function @;,, this expression has no meaning. 

Now, it is clear that the expression (4.6) is more reliable than the cranking model 
formula (4.3). 


§ 5. Concluding remarks 


It has become evident that the introduction of the model wave function is necessary 
so long as we calculate the moment of inertia by the equation (4.3) or (4.6). The 
introduction of this model wave function is possible only in the representation (3.4). 
The relation among the various methods with the deformed potential model does not 
become clear till the consideration is taken into account in this representation. 

It should be stressed here that the above discussions were possible only on the condi- 
tions (2.10) and (2.11) which assure the separation of the real Hamiltonian into 
rotational and intrinsic parts. In other words, it is just this condition that guarantees 
the validity of the methods with the deformed potential model. Furthermore, it is to be 
remarked that the model wave function should be restricted in the form of @,; in (3.10) 
and the expression having significance even for @; should be used in calculating the 
moment of inertia. 

As mentioned in § 1, the conditions for the separation of nuclear Hamiltonian are 
connected essentially with the interactions between nucleons. Thus the significance and 
the importance of these conditions will be made clear in the next stage of the study of 
nuclear structure, in the sense that they may show a formal condition which is to be 
satisfied by the actual nuclear interactions. It will be the most profound problem to 


make clear this circumstance concerning the investigation of the origins of nuclear 


deformation and other fundamental properties of nuclei. 
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Influence of the Exchange and. Cor- 
relation on the Electron-Phonon 
Interaction in Metals 


Tomiyuki Toya 


The Research Institute for Catalysis, 
Hokkaido University, Sapporo 


August 21, 1958 


Recently Bardeen and Pines (B—P)” 
investigated the role of electron-electron 
interactions in the electron-phonon interac- 
tion in metals by extending the Bohm- 
Pines collective description method. The 
conclusion is that the exchange and corre- 
lation effects on the dispersion relation 
w(q) and on the effective matrix element 
V,,(q) for electron-phonon interaction is 
negligible for small wave number vector q 
of a phonon. 
might, however, be 

According to B—P 


the effective matrix element V,,(q) is given 


The conclusion 


questioned as follows. 


without the exchange and correlation effects 


as 
V,(q) =V.(q) {1+ 87/9 
x 3 &) Ex(k+9) Be)“ 


where V;(q) is the matrix element of the 
electron-phonon interaction, and E,(k) is 
the energy of an electron of the wave 
number vector k, including the coulomb 


energy due to other electrons. The com- 


= 11 ior 2/3 (cy =o) Thee 


now formulated” in terms of the above 


pressibility 


V,,(q) by expressing elastic constants ¢, 
and ¢,,—¢, by the dispersion relation w(q), 
which includes V,,(q), in the long wave- 
length limits; the result is 


(1/«) n= N{—2V (r,) —E,(k=0)) 
+2/3-€,+0.2e7/7,}, (2) 
where N is the number of valence electrons 
in unit volume, ¢, the Fermi energy, V (75) 
the Hartree field at the surface of the 
atomic sphere and r, its radius. 
The compressibility is, on the other hand, 


derived from the cohesive energy E, ac- 
cording to the Wigner-Seitz theory, as” 


Lje=(1/12e1,)d*E fdr, 
= (1/«) y—N[0.204 e?/r, 
+ 0.064 e?/r, {1+ (1+5.14)/r,)~'} 


S(T ES td, ft) * 1), (3) 


referring to the equilibrium condition 
dE,/dr,=0, and the relation, dE, /dr,= 
3/r,§(V(r,) —E(k=0)) —0.2 e*/r;}, deriv- 
ed by Froelich” and Bardeen.” 
(3), a, is the Bohr radius and the first 


In eq. 


term (1/K) 7 is given by eq. (2), while 
the second term, arising from the exchange 
and correlation terms in E,, is not at all 
negligible ; e. g. for Na the first or second 
terms is 1.39-10" dynes/cm® or 0.67-10" 
dynes/cm*, respectively and hence 1 /K 
6.72710" 0.80-10" 


dynes/cm’, taking into account the contti- 


dynes/cim? or 
ynes/ 
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bution from the ion-core exchange repulsion, 
compared with the experimental value 
~0.80-10'' dynes/cm’. 

We are thus led to the conclusion that 
w(q) and V,,(q) must include the exchange 
and correlation terms even for small q at 
variance with that of B—P. 

Present author has included the ex- 
change and correlation effect in the electron- 
phonon interaction by extending the Hartree- 
Fock method and taking the configurational 
Maecke’s 


method; the effective matrix element is 


interaction into account by 


thus found to be, 
V,(q) =V;.(q) {1+ (87e/¢° 

—2B(q) ze’/k;) 

x Dk) (E(k+q) —E(k)) “}", (4) 
instead of eq. (1), where E(k) is the 


energy of an electron including the ex- 
change and correlation energy due to other 
electrons and &, the wave number of an 
electron at the Fermi surface. The term 
B(q), which distinguishes the above equa- 
tion from eq. (1), is rather complicated 
but is estimated approximately to be 1.25 
—0.25 ¢°/gi.ax for Na. 

B(q) is physically interpreted as giving 
the exchange and correlation effects which 
reduces the shielding field as follows. The 
charge density of electrons is redistributed 
by crystal deformation to shield the perturb- 
ing ionic field. Electrons tend, however, 
to be apart from each other by exchange 
and correlation interactions to reduce the 
shielding effect compared with the case of 
their absence. The shielding effect is further 
reduced through the term >(E(k+q) — 
E(k))~" in eq. (4), which replaces S'(E, 
(k+q) —E,(k))= in eq. (1), as seen 


from the approximate relation, 


S(E(k+q) —E(k)) 
= D/D,S)(E,(k+q) —E,(k))~“, (3) 


where D or D, is the density of states with 
or without the exchange and correlation 
energies and D~ D, for monovalent metals. 
The contribution of the exchange and 
correlation energies to the compressibility 
is derived from the dispersion relation «(q) 
and eq. (4) as 
— N{B(q=0) -0.204e*/r, 
+2/3-£,(D'D,—1)}, (6) 


which is numerically equal to the second 
term of eg. (3) for Na with D’D,=0.90” 
and B(q=0) =1.25. 

The validity of eq. (4) for g@ 0x is 
on the other hand examined as follows. 
For the mode q= (7/7, 0, 0), e,= (1, 0, 
0) or q=(0, Z/r,, 0), e,=(1, 0, 0) in 
the metals of b.c.c. structure (27,: lattice 
const.), each ion is displaced alternately 
by +0R or —OR in the direction of x- 
axis from the equilibrium position, and 
hence the contributions from the valence 
electrons to the adiabatic potential may be 
calculated by the cellur method,” admitting 
that the valence-electron coulomb energy is 
approximately balanced by the exchange 
and correlation energies just as in perfect 
crystals. The circular frequency is thus 
found to be 1.64(Ne* M)'? (M; the 
atomic mass), in comparison with 1.60 
(Ne*,M)'* as obtained by the dispersion 
relation and eq. (4). The appropriate 
adiabatic potential amount to 50%, larger, 
if exchange and correlation effects are ignored. 
1) J. Bardeen and D. Pines, Phys. Rev. 99 (1955), 

1140. 
2) T. Toya, Busseiron Kenkyu 60 (1953), 129, 
3) H. Froelich, Proc. Roy. Soc. 158 (1937), 97. 


4) J. Bardeen, J. Chem. Phys. 6 (1938), 372. _ 
5) D. Pines, Phys. Rev. 92 (1953), 39. 
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Electrical Conductivity of 
Monovalent Metals 


Tomiyuki Toya 


The Research Institute for Catalysis, 
Hokkaido University, Sapporo 


August 21, 1958 


The interaction constants C of monovalent 
metals are evaluated from the effective 
matrix element of electron-phonon interac- 
tion derived by self-consistent field method, 
taking the exchange and correlation effects 
into account. 


With the wave function, 
gv (k) =exp (ikr) u(t), (1) 


of an electron of wave number vector k 
and of its Hartree energy E,(k) =£,+ 
i? /2m-k’, the effective matrix element V,,.’ (q) 


of electron-phonon interaction is given as”? 


V°(q) =Vi(q) A+S(q))", 
where 
S(q) = (8 7e?/q?— 2B(q) 7e*/kr) 
x D/DyS) (k) (E(k +9) — Eo (k)) ay 
(3) 
and,” 
Vi(gy=in"(e,, @) {47Ne?/q° 
47 (r) —Ek=0))}9@)3 (4) 


e, is the displacement vector of ions, 
g(x) = 3 (sinx —x cosx) /x*, x=1,q, V(t.) 
is the Hartree potential energy at the 
surface of atomic sphere of radius r, and 
y=uz(r,). D or Dy is the density of 
states at the Fermi surface with or without 
exchange and correlation energies, and the 


term B(q) in eq. (3) allows for the effect 


of the exchange and correlation energies on 
the screening field, as discussed in the 
foregoing letter.’ 

Wave function (1), on which eqs. (2), 
(3) and (4) are based, is, however, known 
as a poor approximation in the case of Li; 
the approximation might be improved by 


using the function, 
g (k) =exp (ikr) {u)(r) +i(kr)v(t)}, G) 
and its energy E,(k) =E,+ab’/2m-k’, for 
calculating the matrix elements, where @ 
1S given as 
a— =u) (r,) (dv/dr) range: 
We have thus, approximately, 
V(q) =Vi (q) + (k/ke)* sin’ (qk) Vi (q), 
(6) 
where V;(q) is given by eq. (4) and 
V!(q) =iN-'? (y—@) (€,q) (6°/2m kr’) 
x 6/x2(y (x) —sinx/x), 


and k, is the wave number at the Fermi 


surface. The effective matrix element is 


derived self-consistently as, 
V,(q) = Vi (q) 
—V;(q)T (q)) (1+5(q))~ 
+ (k/kn)®sin?(qk)Vi(q), (7) 
where 
T (q) = (8 7e"/q?— 2 Bre*/ky") 
x D/D, >) (k/kr)* (En (k +9) 
—E,(q))~: 


The terms omitted in eqs. (6) and (7) 
are proportional to gq’ and are negligible 
for ¢<max- 

The interaction constants C are now 
evaluated as follows: B(q) are determined, 


when we assume each of them, respectively, 
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Table I 


10.65 22.9 18.2 8.6 6.0 53.8 


at Gavel ge scsi) | (250) (150) (80) (50) (35) (273) 
rs (in atomic unit) 3,22 3.96 4.87 5.18 5.57 2.68 
a=m/m* | 0.69 1.0 1.0 1.0 1.0 1.0 
ya) 0.31 0 0 0 0 0 
fy (eV) |, “ssi 3.17 2.10 1.86 1.61 6.95 
B(q=0) } 1,32 1.25 1.21 1.18 1.18 1.37 
D/D)” | 0.837 0.90 0.935 0.945 0.965 0.874 
V—Ey (eV) —0.80 0.08 —0.02 0.03 0.00 37 
C/éo (from eq. (2) or eq. (7)) | ach 1.08 £12 1.20 1.20 Lisi 
Cleo (Hartee approx.) 0.72 0.75 0.77 0.80 0.82 1.12 
O°K / 330~360 180~202 114 65 45 320 
C/o (experiment) | 1.45~1.6 | 0.98~1.08 0.98 1.32 1.29 1.51 


a) We assumed a=y=1 for metals except Li. b) D. Pines, Phys. Rev. 92 (1954), 626. c) FS. 
Ham, Solid State Physics, Vol. I (1955), 185. For the value of Cu, see the following letter in this issue. 
d) See Ref. (2). 


being approximately constant at q=0, and 1) T. Toya, Busseiron Kenkyu 59 (1952), 179; 
ibid., 99 (1955), 33; Prog. Theor. Phys. 19 
(1958), this issue. 

2) J. Bardeen, Phys. Rev. 52 (1937), 688. 

are the contribution of the exchange and 3) A. Sommerfeld & H. A. Bethe, Handbuch der 

correlation energies to the compressbility.”» Physik, Vol. 24, Il (1933), 513. The conclu- 

The Umklapp processes are taken into ac- sion of E. J. Blace (Phys. Rev. 99 (1955), 1735) 

or R. Barrie (ibid., 103 (1956), 1581) is not 


correct. Cf. M. Tsuji, J. Phys. Soc. Japan, 12 
Table I shows the calculated values of C (1957), 828. 


equating the second term of eqs. (4) and 
(6) of the foregoing letter, which both 


count in the same way as done by Bardeen.” 


and associated quantities in comparison with 
its experimental values C,.,,, which have 
been worked out from the observed conduc- 
tivities (7) according to the equation,” 
Normal Vibrations of Copper 
Cop =const.ky (dE /dk) ° MO(To)—, 
where M is the atomic mass, @ the Tomiyuki Toya 
Debye temperature, and (dE/dk) ,=D/D, 
(4E,/dk) p. 

As seen from Table I, the theoretical 
values are considerably improved compared 
with those of the Hartree approximation,” 


The Research Institute for Catalysis, 
Hokkaido University, Sapporo 


August 21, 1958 


by taking the exchange and correlation Frequency vs wave vector relations for 
effects into account. Cu are calculated in the (100), (110) 
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and (111) direction on the basis of the 
dispersion relation, 


Mo (q)"=— 2: {V7,* (q+K,) 
XV; (q+K,) /(E(k+9q+Ki) —E(k)) 
+V;*(q+K,) 
XV, (q+K,) /(E(k—-q—-K,) -—E(&))} 
+42Ne?/3+H,+H,, (1) 


which have previously been derived from 
the first principles by the present author 
for the calculation of the normal vibrations 


of Na.” 


mass, w circular frequency of the longitudinal 


In eq. (1), M is the atomic 


or transversal lattice vibration of the wave 
‘sumber vector q in the principal directions, 
K, a vector of the reciprocal lattice space, 
V,(q+K,) and V, (q+K,) matrix elements 
of electron-phonon and effective electron- 
phonon interactions respectively, E(k) the 
energy of an electron of wave number k, 
and N the number of valence electrons in 
The third and fourth terms, 
H, and H,, are the contributions of 


unit volume. 


potential energies of the coulomb and ex- 
change repulsions between ion-cores respec- 
tively. 

The first term represents the contribution 
from the electron-phonon interactions, in- 
cluding electron-electron interactions. The 
summation extends over the valence electrons, 
and Umklapp processes K,, ~0as well. The 
second term covers the second order pertur- 
bation of the ionic field by lattice defor- 
mation. 

The first term in eq. (1) is evaluated 
assuming a=m/m*=1, y=, (r,) =1 and 
V (r,) ~E(k=0) =3.7 eV. (notations are 
the same as those in the foregoing letters”) . 
These assumptions are reasonable except for 


-electrons near the Fermi surface on the 


ground of the recent theoretical calculations 
of the electronic structure of Cu. We 
have, besides, D/D,=0.875 and B=1.37, 
similarly as in the foregoing letter.” Cor- 
rection term 2.90-107* 7,4(9,'+4,'+4.') 
—2.06-107" r,‘|q\* is added to g(x) in 
the expressions V; and V,,, as to satisfy 
the condition V,;(K,) =V,(K,) =0 for 
Kone Cine 2) sand RGSS, OF 
0) (2r,; the lattice const.). H, is readily 
computed by Ewald’s method,” its contri- 
bution to the elastic constants ¢,,—Cy and 
C4, being found identical with the results 
of Fuchs,” as determined in the long wave- 
length limits of transversal vibrations. We 
compute H, assuming the exchange repul- 
sions U(r) between ion-cores to be the 
central force potentials of the Born-Mayer 
type, as 

U(r) =0.120 exp 11.5(r,—1r)/reeV, (2) 


where 7,=) 2 7% is the distance between 
nearest neighbours in the equilibrium con- 
figuration of the lattice. 

Frequency vs wave vector relations are 
evaluated with the above values, as shown 
by solid curves in Figs. (b), (2), and, (3)s 

of 


100 
6 
5 L 
Yn 
O 
eat 
= 
x 
3 
2 
1 
1.0 2.0 2.76 
<107°CM"' 
Fig. 1 
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x 10'CPS 


1.0 2.0 ZiQS. 


<10'CPS 


10 2.0 2.38 
MILOICME 
Fig. 3 


Crosses in the figures show the experimental 
results of Jacobsen” determined from the 
temperature diffuse scattering of X-rays ; 
two constants of eq. (2) have been adjust- 
ed to the experimental values at the largest 
q in Big. (3). 

The elastic constants ¢,,, Cy and ¢,,—¢p» 
are estimated approximately to be 14.0, 
8.2 and 4.6 in units of 10" dynes/cm? 
respectively. The electronic term compen- 
sate 80% or 20% of the contribution from 


the coulomb terms (the second and third 
terms of eq. (1)) to Cx, or ti.—fa “The 
value of c,, is somewhat small compared 
with the experimental value. The dis- 
crepancy might be attributed to the simple 
assumptions for the electrons near the 
Fermi surface as well as for ion-core ex- 
change repulsions given by eq. (2). 

The author expresses his sincere thanks 
to Professor J. Horiuti for his continual 
encouragement and interest shown to the 
present three works. 


1) T. Toya, Busseiron Kenkyu 60 (1953), 129; 
cf. also J. Bardeen and D. Pines, Phys. Rev. 99 
(1955), 1140 and T. Toya, J. Research Institute 
for Catalysis Vol. 6 (to be published). 

2) T. Toya, Prog. Theor. Phys. 19 (1958), in this 
issue. 

3) K. Kambe, Phys. Rev. 99 (1955), 419; M. 
Fukuchi, Prog. Theor. Phys. 16 (1956), 222; 
P. J. Howrth, Proc. Roy. Soc. A 220 (1953), 
513; Phys. Rev. 99 (1955), 46. 

4) P. P. Ewald, Nachr. Akad. Wiss. Gottingen, 
N. E. IL? 3 (1938), 55. 

E. W. Kellerman, Phil. Trans. Roy. Soc. London 
A 238 (1940), 513. 

5) K. Fuchs, Proc. Roy. Soc. A 151 (1935), 585; 
A153 (1935), 622; A157 (1936), 444. 

6) E. H. Jacobsen, Phys. Rev. 97 (1955), 654. 

7) Cf. C. Zener, Acta cryst. 3 (1950), 346; H. 
B. Huntington, Phys. Rev. 91 (1953), 1092. 
In the case of Na, the contribution from the 
first term to shear moduli is negligible as 
assumed by Fuchs (cf. ref. (1)). 


A Note on the Elastic Seattering of 
High Energy Deuterons by 
Complex Nuclei 


Yoshihiko Nishida and Hiromi Tanaka 
Department of Physics, Saga University, Saga 


September, 18, 1958 


Recently it was pointed out by Sakamoto. 
and Sasakawa’ that a faithful application. 
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of the impulse approximation to elastic 
scattering of high energy deuterons by 
nucleons leads to a more excellent agreement 


than Chew’s 


result.” As for elastic. scattering by complex 


with experiment original 
nuclei, however, the impulse approximation 
will not be so good. The analysis by 
Baldwin et al.” shows that as smaller 
scattering angles the differential cross section 
by the impulse approximation is fairly 
larger than the observed one, but with 
increasing scattering angles the former 
becomes conversely much smaller than the 
latter. The success of the impulse ap- 


proximation in nucleon-deuteron — elastic 
scattering suggests that most part of this 
Jarge disparity between theory and experi- 
ment may be due to the simultaneous 
scattering of both nucleons of deuterons by 
Now, Akhieser and Sitenko” 


(hereafter referred to as A—S) recently 


nuclei. 


proposed the theory of diffractional scatter- 
ing of high energy deuterons by nuclei. 
Their theory corresponds to a generalization 
of the optical model to the deuteron- 
scattering. It is very clear-cut in principle 
and the effect of simultaneous scattering is 
straightforwardly introduced. Therefore, we 
may consider their theory as an improve- 
ment on the impulse approximation. Then, 
it comes into question to investigate how 
far theory is available in practical analyses. 
In this note we shall report some results 
obtained by applying the 4—S theory to 
the elastic scattering of 156 MeV deuterons 
by carbon.” On the other hand, the effect 
of simultaneous scattering was independently 
discussed also by Stapp” with a quite 
different idea. Therefore, the following 
discussion will naturally be developed in 
close contact with his theory. 

First of all, it might be pointed out 


that although Stapp’s theory is developed 
in a laborious manner compared with the 
A—S theory, both theories lead to nearly 
The W.K.B. approxi- 
mation on which the A—S theory is based, 


the same results. 


well reproduces the nucleon-nucleus scatter- 
So, it will be 
permitted within the frame of the A—S 


ing data at high energies. 


theory to replace the nucleon-nucleus scatter- 
ing amplitude by a suitable empirical 
formula. To compare with the result of 
Stapp’s theory, it is desirable to use the. 
same empirical formula for the nucleon- 
nucleus scattering amplitude as he did. 
Indeed, we also use the form obtained by 
Greider” to fit the cross section data of 
Strauch and Titus for 96 MeV protons on 
carbon. The scattering amplitude for 
simultaneous scattering is calculated with 
the assumption that the phase in the 
nucleon-nucleus scattering amplitude is angle- 
independent and with the Gaussian type 
When 
the spin-dependent effect is omitted, the 
result shows that [’(x) and I’(x, y) ap- 


pearing in the scattering amplitude in 


of S-state deuteron wave function. 


Stapp’s theory have only to be replaced by 
I(x) and I(x, y), respectively. There are 


simple relations between them : 
I(x) I(x) [Ret 4x sin? § 6/Ri)", 
I(x, y) ~I'(x 9) 
x[R2+2(x+y) sin? § O/R/)". 


Here R,=2.64fermi is the deuteron 
radius and @ is the deuteron-scattering 
angle. However, the effect of this substitu- 
tion is easily seen to be negligibly small 
at the scattering angles less than 30°. 
Therefore, we shall find it convenient to 
use the A—S theory for a practical calcu- 


lation. The differential cross section 
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obtained using the A—S theory is compared 
with the result by using the impulse ap- 
proximation in Fig. 1. 

Next, we calculate the differential cross 
section using a more familiar deuteron wave 
function ¢(r) = a/2me-*"/r to make a 
detailed comparison with experimental data. 
Difficulty in the analytical integration in 
this case be avoided by making a suitable 
approximation in the integrand : 


4a/k arctan k/4a~p, exp(—Ak/2) 
+ ps exp (—?/2) +p; exp(—rk*/2) 


differertial cross section (mb/ster) 


scattering angle (degree) 


+ + + + + Experimental points. 

netovasseneeeanene Impuls approximation for Gaussian 
deuleron wave funtion. 

telat telatalel Theoretical curve for the Gaussian 
one. 

soe Impuls approximation for the fa- 
miliar one. 

———_ Theoretical curve for the familiar 
one. 


Fig. 1. Angular distribution of 156 MeV 


where p,=0.59, po=0-10, p3=0.31 
4=0.8 fermi®, “=0.2 fermi’, 
y=0.01 fermi’. 


The theoretical differential cross section is 
in good agreement with the observed one 
at smaller angles, and even at larger angles 
the former is smaller than the latter, at 
worst by the factor of 1.4 only. About 
It might be 
mentioned that the angular distribution at 


the detail, see also Fig. 1. 


smaller angles is insensitive to the form of 
sticking factor. 

Finally, the comments will be made on 
the assumption of angle-independent phase 
and the limit of applicability of the A—S 
theory. In the W.K.B. approximation, 
the phase of the nucleon-nucleus scattering 
amplitude in this case increases very slowly 
with the scattering angle for the angles 
less than about 30°. So, the above as- 
sumption will be a good approximation in 
this region. This means that only for the 
deuteron-scattering angles less than about 
15° the assumption of constant phase is 
It will be 


understandable that for the deuteron-scatter- 


a reasonable approximation. 


ing angles over about 15°, the theoretical 
differential cross section deviates from the 
observed one. In this region, however, the 
W.K. B. approximation for the nucleon- 
nucleus scattering will be less valid. So, 
it will be unsuitable for improvement of 
the situation to use the angle-dependent 
phase by the W. K. B. approximation. We 
would rather consider that this region lies 
outside the limit of applicability of the 
A—S theory. To obtain a satisfactory 
agreement with experiment including the 
polarization effect in this region, it is 
necessary to apply an improved approxima- 
tion on the W.K.B. one. It might be 
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emphasized that the A—S theory (and also 
the Stapp theory) will be available enough 
to improve the impulse approximation only 


for smaller scattering angles. 


1) Y. Sakamoto and T. Sasakawa, Prog. Theor. 
Phys. 19 (1958), 745. 

2) G.F. Chew, Phys. 74 (1948), 809; 80 (1950), 
196. 

3) Baldwin, Chamberlain, Segre, Tripp, Wiegand 
and Ypsilantis, Phys. Rev. 103 (1956), 1502. 

4) A. IL. Akhieser and A. G. Sitenko, Phys. Rev. 
106 (1957), 1236. 

5) H. P. Stapp, Phys. Rev. 107 (1957), 607. 

6) K. Greider, unpublished result. See (60) (61) 
in Stapp’s paper. 


Some Remarks on the Scattering 


Problem 


Masakuni Ida 


Department of Physics, University of Kyoto, 
Kyoto 


October 2, 1958 


Inyalidity of Levinson’s theorem in meson 
scattering. 

We know Levinson’s theorem” in quan- 
tum mechanics which states under certain 


conditions, 
3(co) —8(0) =m", 


where O(p) is the scattering phase shift 
for momentum p of the scattering particle 
and m the number of bound states. It is 
often applied to the problem of meson 
scattering. But it is essential in the deri- 
vation of this theorem that the scattering 
is due to a local, velocity independent 


potential. Since the scattering of mesons 
cannot be described as the one due to such 
a potential, it is doubtful whether the 
theorem keeps its validity for meson scatter- 
ing. Indeed, we can show that this is not 
the case. 

In a simple model presented by Dyson,” 
assuming that the square of the cut-off 
factor is of the order of 1/p***, with s>0, 
as poco, we obtain for N —@ scattering 


the relation 
r==——-{0(co) 260) =n 11," 


when n is the number of zeros of the func- 


tions 4(z) defined below : 


sft jee gpemnel sin oO 
A(z) =>) Bey Hg sin’ 0 (q) ' 
t=1 OR, <% i ial CPE are 
p 
15,20, onc, = Ag). 


1p, defines the effective coupling of the 
i-th bound state B, with the energy my 
Wp, to the N—@ system. my and // are 
the masses of the N and 0 particles, re- 
spectively. The cut-off factor, v(q), 1s 
normalized in such a way that v(0) =1. 
The parameter o=m-+7, indicating the 
deviation from Levinson’s theorem, deter- 
mines the number of zeros of h(z), accord- 
ingly the form of the solutions of the Low 
equation.” In other words, o is the para- 
meter concerned with the selection of 
redundant solutions” of the Low equation 
for this model. o equals the number, 
n+1, of the bare V particles which cannot 
be directly observed. We notice that o 


itself is an observable quantity. 


* When h(y) =o, then r=nt+lo—m. A teso- 
nance occurs at zero kinetic energy. For the sake 
of simplicity we do not consider this case in this 


letter. 
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Levinson’s theorem is in general invalid 
for the charged scalar meson theory” also, 
and perhaps: for the symmetric pseudo- 
scalar one.” The theorem loses its kinema- 
tical character in the case of meson scatter- 


ing. 


Positive and negative resonances. 

We classify the resonances into two 
types, positive and negative, according to 
the sign of dd(p)/dp at the resonant 
energies. If we denote the number of 
resonances of respective types by 7, and 


r,, then the relation 
Tn, ty 

holds generally under the elastic approxima- 
tion and the assumption that the cut-off 
factor damps sufficiently for high energies. 

An example of negative resonance is 
‘obtained for the charged scalar meson 
theory with the one-meson approximation. 
In the Serber-Lee solution® there is a 
bound state in the 7*-+p system for the 
of the 


constant Ay > 1. 


square meson-nucleon coupling 
Corresponding to this 
bound state there appears a resonance of 
the negative type at the meson energy 
wo=sy fl. 

Resonances of the two types have some 
mutually contrasting properties : 

a) When we analytically continue the 
S-matrix downwards on the right side of 
the branch point at w=y, there usually 
appears in the lower half of the complex 
w plane a pole of S(w) corresponding to 
a positive resonance,” but it seems rather 
usual that there is mo such pole for a 
negative one. 

b) In the resonant scattering of a wave 
packet, the packet is delayed by a positive 
resonance, but is advanced by a negative 
one. 


c) The width of a negative resonance 
has a lower bound in order to be consistent 
with the requirement of causality. 

The well-known 3-3 resonance in the 
low energy 7—N collision is of the positive 
type. It may be interesting whether nega- 
tive resonances appear in the higher energy 
region. 

Details of this letter will be reported 


before long in this journal. 


1) N. Levinson, Kgl. Danske Videnskab. Selskab, 
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3) F. E. Low, Phys. Rev. 97 (1955), 1392. 

4) L. Castillejo, R. H. Dalitz and F. J. Dyson, 
Phys. Rev. 101 (1956), 453. 

5) G. F. Chew and F. E. Low, Phys. Rev. 101 
(1956), 1570. 

6) S. T. Ma, Rev. Mod. Phys. 25 (1953), 853. 
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Added note. After communication to the editors 
I became aware of the paper by Martin,* in which 
he had studied in detail the validity of Levinson’s 
theorem for the scattering due to a non-local but 
separable potential and shown that this theorem is 
valid only when there is no degeneracy for positive 
energy wave functions. 


Superconductivity of Ferromagnetic 
Metals 


T. Kasuya 


Physical Institute, Nagoya University 


October 3, 1958 


In rare earth metals, Lanthanum exhibits 
superconductivity with the transition tem- 


* A. Martin, Nuovo Cimento, 7 (1958), 607. 
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perature 4~6°K. However, the other 
elements do not exhibit superconductivity. 
This phenomenon is explained by the 
recent theory of superconductivity by Bardeen 
et al.” and the so-called s—d interaction 
investigated by us.” 

The Hamiltonians of electron-phonon and 


s—d interactions are written as 


H,,= pap 32 {C (q) dj. +4, vk. te 


we G) aj: wah eq bs* } (1) 
A,a= ee Ly Gy Nee 


x tia (di.gi+ ak + agin tne) 
he Gineg. te jn die. Onta,—} (2) 


where C(q) is the coupling constant of 
electron phonon interaction, J(q) the ex- 
change energy between conduction and un- 
filled inner shell electrons, ajX,, ay, the 
creation and annihilation operators of electron 
with wave vector k, spin » and 6,*, 6, the 
operators of longitudinal phonon with wave 
vector q. 

For the nearly degenerated electronic 
states near the Fermi surface, the effective 
Hamiltonians of binary electrons interaction 
are deduced from the second order pertur- 


bation process as follows, 


HY =— PAPI IP APs V .»(Q) 
ktkl q v 


X aif av Mev Gere Ter qu (3) 
H?=— SS V 5a (q) 
ktkr @ 
X dh, q,— te, + Ger, + Mer +q.—* (4) 


In the calculation of H,?, we used the 
molecular field approximation, and neglected 
the first term of eq. (2) because this term 
has the same character as that of ordinary 
impurity scattering, t that is, the freedom of 
Jn is not concerned. Further, neglecting 


the energy differences between electronic 


states in initial, final and intermediate states 


compared with the energies of phonons and 
the molecular field of j,,, we write V,,, and 
Vig) as 


Vy (q) =\C (q) |?/bag (5) 
V aq) =2(9—1)? jn? (q) /NHn (6) 


where w, is the angular frequency of the 
longitudinal phonon with wave vector q, 
ihe the mean value of j,’, and H,, the 
molecular field. 

As Bardeen et al. have shown, the im- 
portant terms for the superconductivity are 
Then, 


considering the screened Coulomb interac- 


of the type of aft. a™ 4 _yr,_ nr, 5 « 


tion, the Bardeen type Hamiltonian becomes 


H?= =. =>! eT Laiges 


Reh 


* 

4 42k, -4_k),— Ue + ° 
(7) 

As is seen from the above formula, H,? 


has the effect to diminish the effect of 


H®, and thus if the s—d interaction is 


Are ee 
de 
ket re.) st 


sufficiently large, the superconductivity is 
destroyed. 
Now let us calculate the order of magni- 


With the 


use of Bardeen’s coupling constant of 


tude of each term in eq. (7). 


V,,, becomes 


ep 


electron phonon interaction,” 


ace te Re (8) 


where ¢ is the Fermi energy, M the mass 
of the ion and , the velocity of the 
longitudinal phonon. For example, using 


the data of Gadrinium, that 1s, 
Nee 3-07 S010" Coc won le eon) Us aeete 
M=2.6 X10~™ gr. 


typ=1.7X10° cm/sec (9=160) 
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J(q)=J (0) =2.5 X 10-* erg. 
3 


kl f= 0-/9 XK 10~ | erg. 
Ga) 


(9) 


where we used the free electron approxima- 
tion and assumed that n/N equals 3, we 


obtain 


=12) 1 
V g=4.2X 10 —— erg. 


Ney 
[ne <0 or erg. 


2 2 
ee Tig ere 
N 


Beorg’ + -(2h,)° 


(10) 


where k, is the wave vector at the Fermi 
surface. 

Thus we may say that Gd does not exhibit 
superconductivity. Also in other elements 
of rare earth metals except La, the situa- 
tions are nearly same as that of Gd and 
we may say that they do not exhibit super- 
conductivity. 

In transition elements of Fe-families such 
as Fe, Ni, etc., s—d exchange integral ](0) 
becomes much larger than that of rare 
earth metals and it is consistent with the 
fact that they do not exhibit superconduc- 
tivity. 

In transition elements much heavier than 
rare earth metals of La-families, ](0) be- 
comes smaller and thus in some elements 
there would appear superconductivity. 

Finally, the auther expresses his cordial 
thanks to Asso. Prof. S. Nakajima for 
valuable discussions. 


1) J. Bardeen, L. N. Cooper and J. R. Schrieffer, 
Phys. Rev. 106 (1957), 162. 
N. N. Bogolubov. 
2) 'T. Kasuya, Prog. Theor. Phys. 16 (1956), 45. 
3) J. Bardeen, Phys. Rev. 52 (1937), 688. 


Proton-Proton Scattering in the 
Gev Region 

Criticism of the Blockhintsey-Bubeler 
Model 


Junko Oba* and Tetsuo Goto** 


*Department of Physics, St. Paul’s University 
** Department of Physics, College of Science 
and Engineering, Nihon University 


July 28, 1958 


According to the recent development of 
experimental analyses in the Gev region, 
“the nucleon structure” has become an 
important problem and the existence of 
“the nucleon core” has been indicated by 
the experiments. 

Blockhintsev and Bubelev dealt with 
nucleon-nucleon collisions using the Wil- 
liams-Weizsacker method and in_ their 
treatment the central collision was considered 
as a matter of little importance. 

Now we want to examine the Block- 
hintsev-Bubelev model in detail. That is, 
we compute, not only the total inelastic 
cross section, but also the cross section 
producing two or more pions. In this case, 
the inelastic cross section rapidly changes. 
with the lower limit of the impact para- 
meter, i.e. the cut-off parameter and with 
When Bubelev’s 
cut-off parameter is used the branching ratio 


of the production cross sections of one 


the coupling constant. 


pion to two or more pions turns out to be 
larger than the experimental value. More- 
over, we are able to calculate the elastic 
cross section from the inelastic cross section 
by making use of the impact parameter in 
this model. Here we used the assumption 
that Its and Minami postulated, i.e. the 
most part of the elastic scattering was due 
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Fig. 1. The elastic cross section. Fig. 2. The inelastic Fig. 3. The cross section for 


cross section. 


to the shadow scattering. Using the former 
cut-off parameter, we obtain the result that 
the elastic cross section also gives a much 
smaller value than the experimental value. 
Thus, the fact that the experiment gives not 
a larger value than the computed value 
shows that this cut-off parameter is not 
suitable as long as we take g?/bc~15. 
Though the validity of the Williams- 
Weizsacker the 


method of clouds of pions is open to 


the 


method or calculation 
question, we do not discuss these problems 
now. Thus Cees, Tina. and Os, which 
correspond to various cut-off parameters are 
estimated respectively. The result is shown 
in Figs. 1~3. by 

The experimental results are shown by 
the dotted line. 
where R is b/m,,c and r, is the lower limit 


Judging from the 


In the figure, a=r,/R, 


of the impact parameter. 


above figures, the inelastic total cross section 1) 
appears to be explained with Bubelev’s cut- 
off parameter, where a=0.3 corresponds to 3) 
Bubelev’s cut-off parameter. However, we 
Errata 


Dispersion Relations in Nucleon-Nucleon Scattering 


Sadahiko MATSUYAMA and Hironari MIYAZAWA 


Equations (6-4) and (7-1) should read 


cannot understand at all the elastic cross 
section and the production cross section of 
two or more pions with a=0.3 


fore, contrary to Bubelev’s model, we do ' 


central collision is negligible, and it is taken 
as an adjustable parameter. 
account the contribution from the central 


collision, of which the maximum value is 
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